FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CAT AC AT T AC TGC AG T AAC AC T C CAC C AT AT AGACCCGGC T T TACC T T ATATCAG T GAC ACT GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGT TATAAGCAAGT TCATGC TCTGAGT CC TGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGG T TAT GT GCT T CACAT GATCAC T AC TGC AGC AGAAT GGTC TAT GT CATT TTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATATT TGATGAAAGGATAAAATATT TCT GTAATGAT TAT GAT TCTCAGGGATTGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GATGAATGCT GATAAT CAGGAAACATGAAAGAAGCCAT T TGATAGAT TAT TC TAAAGGATATCAT 
CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLS FL PSAL VI WT S AAF I FS Y I T AVTLHH I DPALPY I S DTGT VAPEKCLFGAMLN I AAV 
LC I AT I YVRYKQVHALS PEENV I 1 KLNKAGLVLGI L S CLGLS I VANFQKT T L FAAHVS GAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 
N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCC ATGA GTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACAT GGGATCTT ATCCAT AGAACAGC T CAT CAGCCGGGTTGGTGT GAT T GGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAAT GGCAC GGAGAACAATGT T CCAGAAGGGGGAAGT GCATAACA 
AACCATCAGGTT T CT GGGGAATGATAAAAAGT GTTACCACT TCAGCATCAGGAAGTGAAAATCTTACT 
C T T AT TCAACAGGAAGT GGAT GC T T TG GAAGAAT T AAGC AG G CAGCT T T TT C T GGAAACAG CT GATC T 
ATATGCTACCAAGGAGAGAATAGAATAC T CCAAAACCT T CAAGGGGAAATATTTTAATTT T CTTGGTT 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGAT CCT GTCACAAGAGGCAT TGAGAT CACTGT GAAT TATCTGGGAATCCAAT TT GATGT 
GAAGTTTT GGTCCCAACACAT TT CCTT CATTC TTGTTGGAATAAT CATCGT CACATCCAT CAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCT TGA AC TTAAGCCTACTACAGACT GTTAGAGGCCAGTGGT TTCAAAAT TTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTC AGCAGAGAGCAT C CCGT GTGGATAT GAGGCTGG 
T GT AGAGG C GGAGAG GAGC CAAGAAAC T AAAG G T GAAAAAT AC AC TG GAAC T C T G GG GCAAGACAT GT 
C TAT G GTAG CT GAGC CAAAC ACG T AGG AT TT C C GT T TTAAGGT T CAC AT G GAAAAGG T T AT AGC TT T G 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE a 



MS FL I DS S IM I T SQI LFFG FGWL FFMRQL FKDYE I RQYWQVI FS VT FAFSCTMFEL 1 1 FE I LGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRL'LFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMAI.LSGFGAWCPYTYMSYFLROTTDTDILALERRLLQ 
TMDM 1 1 SKEQKRMAMARRTMFQKGEVHNKPS GFWGMI KS VT T SASGS ENLT L I QQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLLIRMSMPLEYRTI ITEVLGELQFNFYHRWFDVI FLVSALSS ILFLYLAHKQAPEKQMAP 

Important features : 
Signal peptide: 



Uj amino acids 1-23 



Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobal ami n -binding proteins 

amino acids 151-160 
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FIGURE a 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAG'TGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCAGAG 
AAGCTAAGGAGGCCTC3TAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TATCACCACCAAAGATCCCATAT TCAACACTCAAA.CT GCAACACAAACAA.CAGAAT TTAT T G T 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
p CTGCTCCAGCTTCCACTTCTATTCCACGGAGA?^AAAAATTGATTTGTGTCACAGAAGTTTTTATG 
Q GAAACTAGCACCATGTCTACAGAAACTGAACCATT TGTTGAAAATAAAGCAGCATT CAAGAATGA 

AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
fj: CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
|4j CAGCAGAAGGAAA.TGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
'^J TGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGTC C AAGAGT CCAAGCAAAAC T ACCGT GC 

f|| GATGCCTGGAAGC TGAAGTTTAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTT TCTT T 

f|j CAT GCT CC T TACCC TGCCCCAGC TGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAG T CCA 

CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
* m GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
CI TTCTAGCCTGGCTATGTCCTAATT^ATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
y| CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
m AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
f£ GGCAGAAAAGT TTAGCCCCTGAAAGCCATGGAGAT TCTCATAACTTGAGACCTAATC TCT GTAAA 

CI GC TAAAAT AAA.GAAATAGAACAAGGCT GAGGATACGACAGT ACAC TG TCAGCAGGGAC TGTAAAC 

ffj ACAGACAGGGTCAAAGTGTT T TCTC TGAACACATTGAGT TGGAATCACTGTTTAGAACACACACA 

C T TAC T T T T TC TGG TCT CTACCAC TGC T GAT AT TT T CTC TAGGAAATAT ACT T T TACAAGTAACA 
AAAATAAAAAC TCT TAT AAAT T TC TAT T T T TAT C TGAG T TAC AGAAAT GAT T AC TAAGGAAGAT T 
ACTCAGTAAT T TGT T TAAAAAG TAATAAAAT TCAACAAACAT T TGC TGAATAGC TAC TAT AT GT C 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AAC GC T AT C TGGGAAGC TAT T T T T T TC AG T T T T GATAT T TC T AGC T TATC TAC T TC CAAACT AAT 
T T T TAT T T T TGCTGAGACTAAT CT TAT T CAT T T TC TC TAAT AT GGCAACCAT TATAAC CT TAAT T 
TAT TATTAACATACCTAAGAAGTACATT GTTACCTCTATATACCAAAGCACAT T TTAAAAGTGCC 
AT TAACAAATGTATCACTAGCCCTCCTT TTT CCAACAAGAAGGGACTGAGAGATGCAGAAATATT 
. TGT GACAAAAAAT TAAAGCAT T TAGAAAACT T 
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FIGURE 6 



MARC FS LVLLL TS I WT TRLLVQGSLRAEE LS I QVS CRIMGI T LVSKKANQQLNFTEAKEACRLLG 
L S LAGKDQVE TALKAS FE TCS YGWVGDG FWI S RI S PNPKCGKNGVGVL I WKVP VS RQFAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I C VT E V FME T S TMS TE TE P FVENKAAFKNEAAGFGGVP T ALL VLALL F FGAAAGLG FC YVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

' f *[ amino acids 1-16 

Q 

ff\ Transmembrane domain: 

yj amino acids 235-254 

'HA 

'Jz N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

JI| Casein kinase II phosphorylation site. 

en 

amino acids 145-149, 214-218 

m 

■■=$? 

mi Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCAT CAT C AGAAG T GAAC C T GGCAAAC T T AC CT CCCAGC TAT C ACAAT GAGACC AACACAGAC 
ACGAAGG T T GGAAATAAT ACCATCC AT GT GCAC CGAGAAAT T C ACAAGAT AACCAACAAC CAGAC 
TGGACAAATGG TC T T T T CAGAGACAGT TAT CACAT C TGT GGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
T T T G T T CT ACATGGC TT T GATAAT T GT T T GAGGGGAGGAGATGGAAACAATGT GGAG T CT C CC T C 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGC TAGT GAAGACCAGAGCAG T T TCATC T GGT TGT GACT CT AAGC TCAGTGCT C T 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T T T TCT T GAGGCAT GCACAT C T GGAAT T AAGGTCAAAC T AAT T CTCACAT CC C T C TAAAAG TAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
T AC AGG T T AAC CT G C AGAAACAG T AC T TAG G T AAT T G TAG G G C GAG GAT T AT AAAT GAAAT T T G C 
AAAAT C AC T TAGCAGCAAC T GAAGAC AAT TAT CAAC CAC G T GGAGAAAAT CAAACC GAGC AGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAA.TGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACATAAGTTGCATTTAGAA 
AT C AAG CAT AAAT CAC T T CAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWIANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECI IDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEY.EVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 



1 * Signal sequence: 

CI 

Zl amino acids 1-19 



t|| N-glycosylation site. 

'j\ amino acids 96-100, 106-110, 121-125, 204-208 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 



^! N-myristoylation site. 

f|l amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGGCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAG TAT AT TGACCACAAGC TCAGAGGAGT GCAGAACCC TGT AGCCCGCTGCAT CATG TGCT GT T T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT TC T GGGC AAGAAG AACGAGGC GCCCCC GGACAACAAG AAGAGGAAGAAGTGAC AGCT C C GG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
T TGAGAGGCT GAGGC GGGCGGATCACC T GAG TCAGGAGT TCGAGACCAGCCT GGCCAACAT GGT G 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GC T AC TCGGGAGGC T GAGGC AGGAGAAT CGC T TGAACCCGGG AGGC AGAGG T TGCAG T GAGC C GA 
GATCGCGCCAC TGCAC T C CAAC C T GGGT GACAGAC T CT GT CT CCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT TAAAGATAT T T TGT TAAC T C 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I S AFI RTLRYHTGSLAFGAL I LTLVQ I ARV I L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
I VRWVLDKVTDLLL FFGKLLWGGVGVLS FFFFS GRI PGLGKDFKS PHLNYYWL PIMTSI LGAY 
VI AS GFFSVFGMCVDTL FLC FLEDLERNNGS LDRP Y YMSKS LLKI LGKKNEAP PDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 {type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 
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GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTT GTGGCAGGC CCC GAGGGCTATGAGACCCAGT GGTGGGAT GCCCCGAGCATT G 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NG FWFFK FL I LVGLT VGAFY I PDGS FTN I WFY FG WGS FL FI L I QLVLL I DFAHS WNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPPCV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVLASLHVMMTLTNWYKPGETRKMI S TWTAVWVKICASWAGLLLYLWTLVAPLLLRNRD 



rf| Signal sequence: 

III amino acids 1-20 

\ li 

Transmembrane domains: 

r II 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
□ 272-283, 324-340, 391-406, 428-444 

U! 
Q 

Q 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC TCCTGTAACC C TCC T CCAGGATGAAC CAC C TGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGT GATGCAGTATGACTACTAT TCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTTT TAGAACT ATGA GTACTACTTT TGT TAAATGTGAAAAACCC TCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CAC TGCTGGCTT TAT T GAAC AG C T AAT AAAG AT T TAT T TAT T G T AAT AC C T C AC AAAC G T T G T AC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGAC T T AATATAT T GAAG T AACAC T T T T T TAG T AAGC AAGAT AC C T T T T TAT T T CAAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CT TGCAAAAGGGGAAGAAAGGAAT TGCGAATACATGTAAAATGTCACCAGACATT TGTATTAT TT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAAT TAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNVNGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPI ISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCGCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATC TAA GCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCAT'CTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC TGT GTGAC T C TAGT CT TGGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGATGC 
T TTAGGAACATGT TT TGCT T T TT TAAAATAT ATATATT TATAAGAGAT CCT TTCCCATT TATTCT 
GGGAAGATGT TTT TCAAACTCAGAGACAAGGACT TTGGTTTT TGTAAGACAAACGATGATAT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 



APP_ID=1 0063728 



Page 169 of 322 



FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLIALEPGILDTANfVE 
ALRLAGLGLQQLDEGL FSRLRNLHDLDVSDNQLERVPPV I RGLRGL TRLRLAGNTR I AQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAAIAAVGAAYCWRGRAblAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-rnyristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGAT GAAT C CT TAGAT TCCAAGACTAC T T T GACATCAGATGAG T CAG TA 
AAGGAC CAT AC T AC T G C AGGCAGAG TAG T TGC T GG TCAAAT AT T TC T T GAT T C AGAAGAATC TGA 
ATTAGAATCCTCTATT CAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGAT AT CAG C T T T C T AGAG T C T C C AAAT C C AGAAAACAAGGACT AT GAAGAGCC AAAGAAAG T A 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
CAATAAGAAAAGC CAAAAAAGAGAAGCAT ATC GGT AT C T CCAAAAGGCAGCAAGCAT GAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCT TGGGGGCAATC TAATAGCCCACATGGTT T TGGTAAGTAGACT T TAG TGGAAGGCT 
AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 
CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACAT GGAATCTACATGTAAAT GAAAGT TGGT GGAGT CCACAAT T T T T CT T TAAAATGAT TAG 
T T T GGC T GAT TGCC CC T AAAAAGAGAGAT C TGAT AAAT GGCTCTTTT TAAAT T T TC TC TGAG T TG 
GAAT T G T CAG AAT CAT T T T T T AC AT TAGAT T AT C ATAAT T T T AAAAAT T T T T C T T TAG T T T T T C A 
AAATTTTGTAAATGGTGGCTATAGAAAAACAACATGAAATATTATACAATATTTTGCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C T AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA. 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVY YT FGAL GGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 



AAT TCAGAT T T TAAGC CC AT T C T GCAGT GGAAT T T CAT GAACTAGCAAGAGGACACCATCT T C T T 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T T TAT CAC T GGAT GT GAC TCGGGCTTTG GAAAC T TGGC AGCCAGAAC T T T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACC T CAGAGAGAC TT C G TAC TG T GC T TC T GGAT G TGAC CGAC CCAGAGAAT GTCAAGAGGACT G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGC GT GC TGGCT CC CACTGAC TGGC T GACAC TAG AG GAC TAC AGAGAACC TAT TGAAGT G AA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAA7VAAAAC 
TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GT AT T T AGGC TTTGCCTGCTT GG T GT G AT G T AAGGGAAAT T GAAAGAC T T GCCCAT TCAAAAT GA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAART FDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQWKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
RE P IEWLFGL I SVTLNMLPLVKKAQGRVINVS S VGGRLAI VGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCI E PGL FKTNLADP VKVIEKKLAI WEQLS PD I KQQYGE GY I EKS LDKLKGNKS YVM4D 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAEIJ^PKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 



%l amino acids 13 6-152 



N-glycosylation sites. 



amino acids 161-163, 187-190 and 253-256 



r'f Glycosaminoglycan attachment site. 

L|| amino acids 39-42 



N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGC ATG GAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GG TGAAGCCAAGAACAGCAT TAC T GATT CCCAAATGGAT GAT GT T GAAGT TGT T TATACAATT GA 
CAT TCAGAAATATATTCCATGCTATCAGCT TT T TAGCTT TTATAAT TC TT CAGGCGAAGTAAATG 
AGCAAGCACT GAAGAAAATAT TAT CAAATGT CAAAAAGAAT G TGGTAGGT TGGTACAAATTCC GT 
CGT CAT TCAGAT CAGAT CAT GACGT T TAGAGAGAGGCTG CT T CACAAAAACT TGCAGGAGCATT T 
T T CAAAC CAAGACC TTGTTTTTCT GC TAT T AAC AC C AAG T AT AATAACAGAAAGC T G CT CT AC T C 
ATCGACTGGAACAT TCCTTATATAAACC TCAAAAAGGACTTT TTCACAGGGTACCT TTAGTGGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T T T TAGC CGAGCAGTACAAACACACAGC TC TAAAT TT T T TGAAG AAGAT GGATCCT TAAAGGAGG 
TACATAA.GATAAATGAAATG TATGCT T CAT TACAAGAGGAAT TAAAGAGT AT ATGCA?\AAAAG TG 
GAAGACAGT GAACAAGCAGT AGATAAAC TAGTAAAGGAT GT AAACAGAT T AAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAA.TAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAAT CT GACC T TAAT GG TAGAACACACT GACAT T CC TGAAGC TAG T C CAGC TAGTACACCAC 
AAAT CAT T AAGCATAAAGCC T TAGAC T T AGATGAC AG AT GGCAAT T CAAGAGAT CT C GG T T GTTA 
GATACACAAGACAAACGAT C TAAAGCAAATACTGGTAGTAG TAAC CAAGATAAAGCATC CAAA&T 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTC 
C TACAT TT TGATCCTTT TAACCT TACAAGGAGATT TTT T T ATT TGGC TGAT GGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGT AATACACAGATAAC T CT TAG T GCAT T TAC T T CACAAAGT ACT T TT TCAAACAT CA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT TC T T TAGAAT T GGAAAAG T GAGACCAGGCACAGTGGCTCACACCTG TAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGAT TGAAGCTAGGAGT TAGAGACCAGCCTGGGCAACGTAT T 
GAGACCAT GT CTAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCC T TAT T TT CAAAATAT GGAAA 
GAAAT T T ATATGAAAAT T TATCT GAGT CAT TAAAAT TC T CCT TAAGTGAT AC T T TT TTAGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTA?\AAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 

MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGETOEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHS SKFFEEDGSLKEVHKINEMYASLQEELKS I CKKVEDSEQAVDKLVKDVNRLKRE IEKRRGA 
Q I QAAREKNI QKDPQEN I FLCQALRT FFPNSE FLHS CVMS LKNRHVSKS S CN YNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 



Important features: 
Signal peptide: 

ffl amino acids 1-19 



\\\ 

m 



U! 



N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 



Growth factor and cytokines receptors family. 

?%\ amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGAT GCC CC CCT CCACGAGATCAACGGT GAT CATTT GAAGAT CT GT C CCCAGGGTT CTACCT GCT GCTCT 
CAAGAGATGGAGGAGAAGTACAG CCT GCAAAGTAAAGAT GATTT CAAAAGTGT G GTCAG CGAACAGT GCAATCAT TT G 
CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
AATGATATGTTTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTAGAGTTG 
AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
T GTT T GG C CAAC CAAGG GGAT CT CGATT TT GAAT GGAACAATT T CAT AGAT GCT AT GCT GAT GGT GG CAGAGAGGCTA 
GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
T CCAT CT CT GAAAGT GC CTT CAGT GCT CGCTT CAGACCACAT CACCC C GAGGAAC GCC CAAC CACAGCAGCT GGCACT 
AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
GT TT GCAACGAT GAGAGGAT GGCT GCAGGAAACGGCAAT GAGGAT GACT GTT GGAAT GGGAAAGGCAAAAGCAGGTAC 
CT GT TTG CAGT GACAGGAAATGGATT AGC CAACCAGGGCAACAACC CAGAGGTCCAGGT T GACAC CAGCAAACCAGAC 
AT AC T GAT CCTT CGT CAAATCATGGCT CTT CGAGT GAT GACCAGCAAGAT GAAGAATGCATACAAT G GGAAC GACGT G 
GACT T CTTT GAT AT CAGT GAT GAAAGTAGT GGAGAAGGAAGT GGAAGTGGCT GT GAGTAT CAGCAGT GC C CTTCAGAG 
TT TGACT ACAAT GCCACT GACCAT GCT GGGAAGAGT GCCAAT GAGAAAGC CGACAGT GC T GGT GT CC GT C CT GGGGCA 
CAGGC CT AC CTC CT CACT GT CTT CT GCAT CTT GTT CCT GGTTAT GCAGAGAGAGT GGAG ATAA T T CT CAAACTCT GAG 
AAAAAGT GTT CAT CAAAAAGTTAAAAGGCACCAGTT AT CACTTTT CTACCAT CC TAGT GACTT T GCTT T TT AAAT GAA 
T GGACAACAATGTACAGTT TTTACTAT GT GGC CACT GGTT TAAGAAGT GCTGACT TT GT TTT CT CATT CAGTTTT GG G 
AGGAAAAGGGACT GT GCAT T GAGTT GGT TCCTGCTCCCC CAAACCAT GTTAAAC GTGG C TAACAGTGTAGGT ACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT GAAAT ATTAAATAGCT GTACAGAAGCAGGTTTTATTTAT CAT GT TAT CT TAT TAAAAGAAAAAGC CCAAAAAGC 
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FIGURE 2d 

MARFGLPALLCTIAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 

CCSQEMEEKYSLQSKDDFKSWSSQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 

LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLWSQYHFTDEYLECVSK^ 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 

VTVKPCYNYCSNIMRGCLANQGDLDFEWlSnSfFIDAMLMVAERLEGPFNIESVM 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 

EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGIANQGNNPEVQVDTS 

KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

KSANEKADS AGVRPGAQAYLLT VFC I L FLVMQREWR 



Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) 

amino acids 515-524 



0| N-glycosylation site. 



amino acids 514-518 

Glycos ami noglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 
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CT CGC C C T CAAAT GGGAAC GC TGGCC T GGGAC TAAAGCATAGAC CACCAGGCT GAGTAT CC TGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCG ATGA AAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGG G C AAT G T GAAGAAAAC AAGAC AC C AAAGGC AC CACAGAAAGC C AAAC AAGC AT T CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTG TAG GAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTTAACCTTAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPLMIMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important, features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCC ATGG ACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGGTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAAC T T TCAGT TC TACCCACCGGGCTGCAGGGTCACCT GCCTAGACCCAAAT C CC CACT T T GAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCC CACC T GGAAACACAT TGGGGATGGCT GC T GCCT CACCAGAGAGACCT GGAAGGATCTT GA 
%\ GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
f|| GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
fill T TCCC CAGC C TCCAAT T AGAACAAGCCACCCACCAGCC TAT C TAT C T T CCACT GAGAGGGACC TA 

^ GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGACT CAAAGAGGCAGAGGT TTTGTTCT CAAATAT T T T T TAAT AAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGECVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFiyMQQVFEPT^ 
KH I GDGCC L TRE TWKDLENAQFS E I QMERQPPPLKWL PVGPH IMGKAVKQS FPS S KAL I CS FP S L 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTT'CTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT T ATGAT TC T TATAC T AAT T TAT ACAAAGATAT T AAGGCC C T GT TCAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AG T T AAAAG C T GAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 3Q 
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MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE ai 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCXCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T TGC ATCT GT T T T GATAAAT GAT GT T GACAC C C T CCACCGAAT T C T AAGT GGAAT CATGT CGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGT CTGCGGT GT TT TAT GG T GGC TG TAT TAT GAC TAT ACCAACGAC C T CAGCATAGAAT T GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCGATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
fH ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
y| AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
S\ TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCC GAGAATCAT TGT CATGTACAT GCAAAACGCAC T GAAAGAACAGCAGCATGGTGCAT TG T 
f*S CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
HI AACC AGAATGCAT ATAC TACAACT GC TAT TAAT GGGAC AGAT T TC T GTACAT CAGC AAAAGAT GC 

AT T CAAAATC T TGTC CAAGAACT CAAG T CAC T T TACAT C TAT TAAC TGCT TT GGAGACT T CAT AA 



fiJ 
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TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAAC TCCAGGCCAT TGTGAGATAGATAC CCAT T T AGGTATCT GTACCT GGAAAACAT T 
TCC TTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAAT TTTTT T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 



MSGRDTILGLCIIAIALSIJ^FTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWT C Y FNR SKNDPPDHPILSSLSILF FYH QG TWKG S FL I S WR I PR 1 1 VM YMQNALKE QQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FL S V FE T VL DAL FL C FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 



m 



m 
pi 



ill 



Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 



\n N-glycosylation sites. 

Q amino acids 204-208, 295-299, 313-317 



N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 



APP ID=1 0063728 



Page 186 of 322 



FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAAT t T T T T GAAAAAAAAATT GCCT T CT T CAAACAAGGGT GTCAT T CT GATATT T ATGA GGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT TCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGT TCAC TG T GCC T CAGAT CAAC T GC 
GAT GTCAAAGC CGGAAAGAT CATC GAT CCT GAGT T CAT T GT GAAATGT C CAGCAGGATGC CAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGT GG TGT GCT T GATAAT TCAGGAGGGAAAATACT T G T TCGGAAGGT T GCT GGACAGT C T 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TAT CGT CTTAGAAAGTAAACCCAAAAAGGGT GTAACCTACC CATCAGC T C TTACATAC T CAT CAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
GCCACAGGAG CCAGGAGATGGAT C T CT GGT CCAC T GCCACCTACACAAGCAGCCAAAACAGGCCC 
AGAGCT GAT C CAGGT AT CC AAAGGCAAGAT CC TT CAGGAGC T GCC T TCCAGAAACC T GT T GGAGC 
GGATGT CAGCCTGGGACTTGT TCCAAAAGAAGAAT TGAGCACACAGTCTT TGGAGCCAGTATCCC 
TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
C GG T C C AC T GAT GGGTGTTGTC C AG TAT G G AGAC AAC C C T G C T AC T CAC T T T AAC C T CAAG AC AC 
ACACGAATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAAT 
GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGT T TGGGT T CGACAAGTACAGCAGCAAGCC T GACATC CTCAACGCCAT CAAGAGGG T GGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGC TAGAAGTCATTGCCACTCACCCCGCCAGAGACCACT CCT TCTT TGTGGACGA 
GT T TGACAAC C TC CAT CAGTATG TCCCCAGGATCATCCAGAACAT T TGT ACAGAGT TCAAC TCAC 
AGCC TC GGAAC TGAA T T CAGAGCAGGCAGAGCACCAGCAAG TGCTGCT T TAC TAAC TGACGTGT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGT AT AGAAT GAGCCAAAAGGC TACAT CATG T T GAGGGT GCT GGAGAT T T T ACAT 
T T T GACAAT T GT T T T C AAAAT AAAT GT T C GGAATACAGT GC AGC CC T TAC GAC AGGC T T ACGT AG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T GT CAT GACAAT G T AGGAAT T GCT GAAT TAAATGT T T AGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 3d 



MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
T PTTLPRPS PSAAS TTS I PRPQS VGHRS QEMDLWS TAT YT S SQNRPRADPG I QRQDP S GAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPIWAAmTOGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYVVEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMIL ITDGRS YDDVRI PAMAAHLKGVI T YAI GVAWAAQEELEVIATHPARDHSFF 
VDE FDNLHQYVPRI I QNI CTE FNS QPRN 



\t Important features: 

fll Signal peptide: 

amino acids 1-26 



Transmembrane domain: 



f-'i amino acids 181-200 



N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site, 

amino acids 304-308 
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FIGURE 25 



C C GAGC AC AGGAGATT GCCT GC GTTT AGGAGG TG GCTGCGTT GT GGGAAAAGCT AT CAAGG AAG AAAT TGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGC'TGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGAC AAG ACTT T C ACTTCACACTT CGAGAGC AT T C AAAC TGCT CT CATC AAAATC CATTTC TGGT CAT T C 
M : TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
w TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 

rl 

AT T GTC CT T AGAGG AT GAAC AC CTTCTT T ATGGT GAC AT AATCCGACAAGAT T T TT T AG AC ACAT ATAAT A 
'?[ \ ACCTGACC T TGAAAAC CATT AT GGC ATT CAGGTGGGT AAC TG AGTT TTGCCC C AAT GC C AAG TACGT AAT G 

~C\ AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 
GAAGT T T T T C AC AGGT TATCCT CTAATT GAT AAT T ATTCCT ATAGAGG AT TT T ACC AAAAAACCC AT AT T T 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
TGGC AGGT CAT G CT AAGG AACACC AC AT GC CAT T AT TAAC TTCACATTC T AC AAAAAG CCT AG AAGGAC AG 
GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
ACACTGAACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTCAGG 
CCCTT C AAAGAT GAT ATGTGGAGGAATT AAAT AT AAAGGAAT T GGAGGT T TT TGCT AAAG AAAT T AAT AG G 
AC C AAACAATTT GG AC AT GTCATTCT GT AGACT AGAAT TT CTT AAAAGGGTGTT ACTG AGTT AT AAGCT C A 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TAT CT T AT GT GG AT TACC AATT T AAAAATAT ATGT AGT TC TGTGTC AAAAAACTTCTT C ACT GAAGT TATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
ATTATTTAAAATTACTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
TGAATC AT T CTTT AC ATGCAAACATT T T C C AGTT ACTT AACT GAT C AGTT TAT TAT TG AT AC AT C ACT CCA 
TT AATGT AAAGT C AT AGGTC AT T ATT GC AT AT CAGT AAT C TCTT GGACTT TGT TAAAT ATT T TACTGT GGT 
AATAT AGAG AAG AAT T AAAGCAAGAAAAT C TG AAAA 
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FIGURE 36 



MASALWTVLPSBMSLRSLKWSLLLLSLLSFFVMWYLSLPHYWIKRVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDPCMLA 
LSLEDEHLLYGDI IRQDFLDTYNNLTLKTIMAFRWTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYYGICLNLLKWIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features : 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 12-16, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGG AIGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT T GCC AAGAGAGTACAC AG TC AT TAATGAAGCC TGCCC TGGAGCAGAGTGGAATATCAT GT GTCGGGAGT GCT GT G 
AAT ATGAT C AGAT TGAGTGC GTC TGCCCC GGAAAGAGGGAAGTCGTGGGTTATACCATCCCT TGC TGC AGGAATGAGGAGAA 
T GAGTGTGAC T C C TGC C TGATCC ACCC AGGT T GTAC C AT CT T TGAAAAC TGCAAGAGCTGCCGAAATGGC T CATGGGGGGGT 
ACC T TGGATGACT TC T ATGT GAAGGGGT TCT AC TGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGAC TGCATGC GATGT G 
GCC AGGT TC TGCGAGC C CC AAAGGGTC AGAT T TTGT T GGAAAGC TATCCCC TAAATG CTCAC TGTGAATGGAC CAT T CATGC 
TAAACC TGGGT TTGT C ATCC AAC TAAGAT TT GTCAT GT TGAGT C TGGAGTTT GAC TACATGTGCC AGTATGAC TATGTTGAG 
GTTCGTGATGGAGAC AACCGCGAT GGCCAGAT CATC AAGCGTGT C TGTGGCAACGAGCGGCCAGC TC CT ATC C AGAGC AT AG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCAGACCCTGGGGGCCCAGTCAATGGGTACCAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
T C TT AGTGGCAATGAGAAAAGAAC T TGCC AGCAGAAT GGAGAGTGGT CAGGGAAAC AGCCCAT C T GCATAAAAGC C T GCC GA 
GAAC CAAAGAT TTCAGACCTGGTGAGAAGGAGAGT T C TTC C GATGCAGGT T CAGTCAAGGGAGACAC CAT TACACC AGC TAT 
'f? AC TC AGC GGCC T TCAGC AAGCAGAAAC TGCAGAGT GC CC C TACC AAGAAGC CAGCCC TTCCC T T TGGAGAT C TGCC CATGG G 

f *l ATAC CAACATC TGCAT ACCCAGC TC CAGTATGAGT GC AT CTCACCCTTC TAC C GC C GCC TGGGC AGC AGCAGGAGGAC AT GT 

■I ** C TGAGGAC TGGGAAGTGGAGT GGGCGGGC ACC ATC C TGCAT CCC T ATC TGC GGGAAAAT TGAGAACATCAC T GCTCC AAAGA 

Ul CCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCCATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGC 
s GTGGTTCC TAGTCTGC AGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

k s I GTCACCATGAT C AAGAC AGC AGACC TGAAAGT TGT T T TGGGGAAATTC TACCGGGATGATGACC GGGATGAGAAGACCATC C 

If! AGAGCCT ACAGAT TTC TGC TAT CAT TC TGCAT CCC AAC TAT GACCCC ATCC TGCT T GATGC TGAC ATCGCC ATCCT GAAGC T 

P| CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
f!| TCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 

f' # 1 TGGT C AGTGTGGTGGAC TCGC TGC TGT GTGAGGAGC AGC ATGAGGACCATGGC AT C C CAGTGAGTGT CACTGATAAC ATGT T 

.*•?•;?? 

?\\ CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 

I 97 

CGAGC ATCTCC T GAGC CACGC TGGCAT C T GAT GGGAC TGGT CAGC TGGAGC TATGAT AAAACAT GCAGCCAC AGGC TC TC C A 

CTGC C T TCACC AAGGT GC TGCCT TT TAAAGAC TGGAT TGAAAGAAATATGAAATGRACC ATGC T C ATGCAC TCCT TGAGAAG 

TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 

GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 

CTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAGAAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCA 

CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 

TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 

GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
I PCCRNEENECDSCL I HPGCT I FENCKSCRNGS WGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQI LLES YPLNAHCEWT I HAKPG FVI QLRFVMLS LEFDYMCQYDYVEVRDGDNRDGQI I 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQP I C I KACRE PKI S DLVRRRVLPMQVQSRE TPLHQLYSAAFS KQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTWVAAHCVTDLGKVTMIKTADL 
KVVLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICIAASR 
DLS TSFQESHI TVAGWNVLADVRS PGFKNDTLRSGWSWDSLLCEEQHEDHGI PVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites/ 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



n 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAA 

T CTT T CCAAGGTTGTAC CAC CT GATT C CAGAT GGT GAAATTACCAGCATCAAGAT CAATCGAGTAGAT C C CAGTGAAA 
GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 
TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAATGGAC^GGCCCCGGATGCCTACAGACCCCGAGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
pr- GGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGCAATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACA 
J*p . CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
■%\ GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 
TC^TAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACAGAGGTCAGCC 
% GGAGTGAGGCAGTGGC^TTATTGAAAAGAACATCATCCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTAT^ 
™\ CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
GGGTCATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACACAGCTGGAAGTC 
TGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACA^ 

CACCAGCATACAATGATGGAAGAATTAGATGTGGTGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TT TTATAGAATCAATGATGGGT CAGAG GAAAACAGAAAAAT CACAAATAGG CTAAGAAG TT GAAACACT ATATTTATC 
T T GT CAGTT TTT ATATTTAAAGAAAGAAT ACATT GT AAAAATGT CAGGAAAAGTAT GAT CAT CTAAT GAAAGCCAGTT 
ACACCTCAGAAAATATGATTCCAAAAAAATTAAAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTAC 
AACATT GT T TATAT TTTTT CTATT CAAT AAAAAGCC CTAAAACAACTAAAATGATTGAT TT GTATAC CC CACTGAATT 
CAAGCT GATTTAAATT TAAAATT T GGTATAT GCTGAAGT CT GCCAAGGGTACATTATGGCCATTTTTAATTTACAGCT 
AAAAT ATTT TTTAAAAT GCATT G C TGAGAAAC GTT GCT TTCAT CAAACAAGAATAAAT ATT TT T CAGAAGT TAAA 



ill 
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FIGURE do 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PE VSAAAT I SLMTDEPGLDNPAYVS SAEDGQPAI S PVDSGRSNRTRARP FERST IRSRS FKKINR 
ALS VLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLI PDGE ITS I KINRVDPSESLS IRLV 
GGSETPLVHI IIQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVILNKSSPEEQLGI KLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
^ SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
Q RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 
01 

yi Important features: 

h. i 

pi\ Signal peptide: 

III 

an amino acids 1-15 

■s 

CJ| N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Wl 

. ?*% Tyrosine kinase phosphorylation sites. 

fl\ amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE ai 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTT GC CTGCAGTGAAGCAGAGAGATAGATATTATT CACGT AATAAAAAACATGGGCT TCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTT T GGGGAAGGGAAAAACT CTGAC TAAT GAAGCATC CACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTT GGAAGAGGT ACAGGCAGAAAAT CCCAAAGT GT C CAGAGGC CGGTATCGCCCTCAGGAATGTAAAG 
fa CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
0 ' CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
•Q TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
W] ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
If j CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
f|J GGGT GTTAC TGC C CTAAGCAGAGAGCAGT TTT TCAAGGTGAATGGATTC TCTAACAACTACTGGGGAT 

f|| GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
» CCTGAAGTGGGTAAATATACAAT GGTCTTCCACACTAGAGACAAAGGCAAT GAGGTGAACGCAGAACG 

yt GATGAAGCT CT TACACCAAGT GT CACGAGTCT GGAGAACAGATGGGT TGAGTAGTTGTTCT TATAAAT 

M] TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
Si ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
Pfl TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTAT TTGAGGACT CT GGTTGAAGGAGATT TAT TTAAATTTGAAGT AATATAT TAT GGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGT ACGAAGATACAATACTGTTATTCAT TTATCC T GTACAATCATCT GT GAAGT GGTGGT GTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
ATTAACTAATAATAAATAT GT CTAT CAAATAC CTCTGTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMMTFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKH3LMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKWGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYT1WFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 4.3 



GC T C AAGACCCAGCAGT GGGACAGCCAGACAGACGGCACG ATG GCAC T GAG C T CC CAGAT CT G GG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

AC GGGACAACT TGCAGAGCT GCAACCCCAGGACAGAGC TGGAGCCAGGGCCAGCTGGATG CC CAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAZ\AAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 

%\ 
rlf 

m 
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FIGURE AA 

MMiSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features : 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 



APP_ID=1 0063728 



Page 198 



FIGURE ax 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATGG CTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCT TGTCACCATACAGCCAGAAGGGGGCAC TATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGT AGAC T TCC CAGATGGAGGCTACT CCC TGAAGCT CAGCAAACTGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T TCTGAAGAGAGAGAGACAAGAAGAGT ACAT TGAAGAGAAGAAGAGAGT GGACAT TT GT CGGG 
AAAC TCCT AACATATGCCCCCAT T CT GGAGAGAACACAGAGTACGACACAAT CCC TCACAC TAAT 
AG AAC AAT C C T AAAGG AAG AT C C AG C AAAT AC GG T T T AC T C CAC T G T G G AAAT AC C GAAAAAG AT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TC TAGACAGCAGTGCACTCCCC TAAGT C T CTGCTCA 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVT FPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFS S P I LARKLCEGAADDPDSSMVLLCLLLVPLLLS L FVLGL FLWFLKRERQEE Y I E 
EKKRVD I CRET PNI C PHS GENTE YDT I PHTNRT I LKED PANT VYS T VE I PKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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ID= 



10063728 



Page 200 



FIGURE ah 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAKTGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT T GT GAATT T TCAT TGAAAAACAT CAGTGACAT T CAT CCAGAATCC T TCAA 
^ s CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
r\ ACACCAT GGCGAG TGGCT GGAGAGCAT CT AG T T T C C ACT T CGAT TC TGAAGAAAACAAACATAGG 

Q CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
fl CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
f\ TGTAGTTTAATGGGAATAAAATGTAAGTATCAGTAGTTTGAAAAAAAAAAA 

s 

□ 

if! 

r! 

rl| 
P! 

ry 
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FIGURE 48 



MTCCEGWT SCNGFSLLVLLLLGWLNAI PL I VSLVEEDQ FS QNP I S CFEWWFPG I IGAGLMAI PA 
TTMS LT ARKRACCNNRT GMFL S S FFS V I TVI GAL YCML I S I QALLKGP LMCNS PSNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 

,7 

I N-glycosylation sites. 

1 amino acids 80-84, 132-136, 148-152, 163-167 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 



TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT T TG CCAATAAAGACGAT CCC T TCTACT AT GAC TGGAAAAACC TGCAGCT GAGCGGACTGAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
C TACT T GCTGAGCACAGGACTGGCC TC CAGGGAT GGCC TGAAGCC TAACACT GGCC CCCAGCACC 
p " TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
% TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
ff% 

hi ' 

.! „ i 

| 

r? : 
\ I? 

m 

?*% 
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FIGURE so 

MERVTIJU.LLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 3 6-59 

N-myristoYlation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



G TGGAC T CTG AGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGC TGCAGAC CCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
P TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
Q GTGCGTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
I" 1 ; CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
I!'* GAAACTCAGCAGGCAGCT T TGGAAT GAATCCT CAGGGAGC T CC C TGGGGT CAAGGAGGCAATGGA 

E>|f GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
fflf AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
J? ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 



m 



m CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
Q CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
rU CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACT C T GAGACGTC TCCTGGGAT GT T TAAC T T T GACAC T T T C T GGAA 
GAAT TT T AAAT CCAAGC TGGGT TT CAT CAAC T GGGAT GCC ATAAACAAG GACCAGAGAAGC T C T C 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *2 

MKFQGPLACLLLAXCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAA.GSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISJCEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

CI Signal peptide: 

amino acids 1-21 

y I 

N-glycosylation site. 

HI 

n! amino acids 265-269 



is Glycosaminoglycan attachment site. 

Ep amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

'■•7 ? 

^JJ Casein kinase II phosphorylation site. 

%% amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

m 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 2.70-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE S3 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CAT TGTCAGATGAGGATATAAGAGCAGAGGCTGACACCT TCATGTTTGGAGGC CATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 



MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 

SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 

IAFIPFSAGPRNCIGQAFAMAEMKWLAfflLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites, 

amino acids 112-116, 168-172 
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FIGURE ss 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
G TGAT T CGAACAT T TGAGGG T T AC T T T T G GAAGCAACAATACAT T CT CGAACC T GAATG T CAG TA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TC T GGAT GT T GTC CCAC TGAAT TC C CAT GAATACAAACC TAT TCAGCAACAGCAAAAAAAAAAAA 
^^^^ 
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FIGURE *6 

MGPVKQLKRMFE P TRL IAT IMVLLC FALT LCS AFWWHNKGLAL I FC I LQSLALTWYS LS FI P FAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

^ amino acids 30-72 
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FIGURE S7 

C GG CT CGAGCT C GAGC C GAAT CGGCT CGAGGGGCAGT GGAGCACC CAGCAG GC CGCCAACATGCT CT GT CT GT GCCTG 
TAC GT GC C GGT CAT C GGG GAAGC C CAGACCGAGTT C CAGTACTTT GAGT C GAAGG GGCTC CCT GC CGAGCTGAAGT C C 
AT TTT CAAGCT CAGT GT CTTCAT C CCCT CCCAGGAAT T CT CCACCTAC CGC CAGT G GAAGCAGAAAATTGTACAAG C T 
GGAGATAAGGAC CTT GAT GGGCAGCTAGACTTT GAAGAAT T T GT C CATTAT CT CCAAGAT CAT GAGAAGAAGCTGAG G 
CT GGT GT TTAAGATTT T GGACAAAAAGAAT GAT GGAC GCATT GAC GCG CAGGAGAT CATG CAGT C C CT G C GGGACTT G 
G GAGT CAAGATAT CT GAACAGCAGGCAGAAAAAAT T CT CAAGAGCAT GGATAAAAAC GGCAC GAT GACCAT C GAC T GG 
AAC GAGT GGAGAGACTAC CAC CT CCT CCAC CCCGT GGAAAACAT CC CC GAGAT CAT C C TCTACTGGAAGCAT T CCAC G 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCT CC C GCAG CAACAACATG GGCAT C GT TGGT GGCTT C ACT CAGAT GAT T CGAGAAGGAGGGGCCAGG 

TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT C GC CCAGAGCAGCA.T CT ACC CAATGGAGGT CCT GAAGACCC GGAT GGC GCT GC GGAAGACAGGC CAGTACT CAGGA 
ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
GGCAT CATCC C CTATGCC GGCAT CGAC C TT GCAGT CT ACGAGACGCT CAAGAATGCCT G GCT GCAGCACTAT GCAGT G 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GT C CT GCT GACC C CAGCAGACC CT C CTGTT GGT T CCAGC GAAGACCACAGGCAT T CCT TAGGGT C CAGGGTCAG CAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAAT CCAT GAT GAAAGGT GAGGT CAC GT GGC CTC CCAGGCCT GACTT CCCAAC CTACAGCAT TGAC GC C AACTTGGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCT GGAC C CT GT C AGGAT GGG C CCCAC CTCAGAACCAAACTCACT GT C CCCACTGT GGCAT GAGG GCAGT GGAGCA 
CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACT GT TGGGAAAAGGGTT TT GT CCAGAAG GACAAGCC GGACAAAT GAGCGACT T CTGT GCTT CCAGAGGAAGAC GAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
AACTATT TT T ATAGATT T GTTT AATTAATAGCTT GTCAT TTT CAAGT T CATT TT TTAT T CATATTTAT GT T CAT GGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GT TGGGGAAGGGCT T GC CC C CAGC CTTAGGATT T CAGGGT TT GACT G GGG GCGTGGAGAGAGAGGGAGGAAC CT CAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATTGTTATATATGAACATATAACTGGAGTCGTCAAAAAG 
CAAAT T AAGAAAGAAT T GGACGTTAGAAGT T GTCATTTAAAGCAG CCTT CTAATAAAGTT GTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 

MLCLCL YVP V I GEAQTE FQ Y FE S KGL PAE LKS I FKL SVFIPSQEFST YRQ WKQK I VQAG DKDLDG 
QLDFEE FVHYLQDHEKKLRLVFK I LDKKNDGR I DAQE IMQS LRDLGVK I SEQQAEKI LKSMDKNG 
TMT I DWNEWRD YHLLHPVENI PE 1 1 LYWKHS T I FDVGENLT VPDE FT VEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVIidQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGIWLKIAPESAIK 
FMAYEQ I KRLVGS DQE TLR IHERL VAGS LAGAIAQS S I Y PME VLKTRMALRKT GQYS GMLDCARR 
ILAREGVAAFYKG WPNMLGI I PYAGI DLAVYETLKNAWLQHYAVNSADPGVFVLIACGTMSS TC 
GQLAS Y PLALVRTRMQAQAS I EGAPEVTMS S L FKH I LRTE GAFGL YRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE so 

GGAAGGCAGC GGCAGCTCCACTCAGCCAGTAC CCAGATAC GC T GGGAAC C T T C C CCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CAC T CATC AT TGGC T T T GGT AT T T C AGGGAGACAC TCCAT CACAG T CAC TAC T G T CGCC T CAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATAC TC CT GT ATGAT T GAAAATGACAT T GCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CAC CAGAT AT GACC TAGT TT TATAT T TCT GGGAGGAAATGAAT TCATAT C TAGAAGT C TG GAGT G 
AG C AAAC AAGAGCAAGAAAC AAAAAGAAG C C AAAAGCAGAAG GC T C C AAT AT GAACAAGATAAAT 
CTATCTTCAAAGACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACC TGGGGAGTGAGAGGACAGGATAGT GCAT GT TC TT T GT C TCT GAAT T T T TAGT TATAT GT GC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT TCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAAT T TTAGCATAAACAGAGCAGT 
C GGGGAC ACC GAT T T T AT AAATAAAC T GAGC AC CTTCTTTT T AAAC AAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGMANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTS FELNSENVTMKWSVLYNVT INNTYSCMIENDIAKATGD IKVTE SE I KRRSHLQLLNSKASL 
CVSSFFAISWALLPLSPYLMLK 



Important features: 
Signal peptide: 

amino acids 1-28 



s»| Transmembrane domain: 

%| amino acids 258-281 

111 

m 

N-glycosylation sites. 

% % amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 

vk 220-224 

q 

1=9 N-ntyristoylation sites. 

y amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT T GGT C T T CAT T C AAT GAT GAGAC C T G C C T CAT GAT GAT AAACAT G T T T GAC AAGAC CAAG T CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
C T GACC AAGCAT GAGAG AGAT C T G T C TAT GGGACCAGT GGCT TGGAT T C T GCC ACAC CCAT AAAT 

CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGCCAAAAGT C CAG TGAAAT T G TAAGCT T CAATAAAAGGATGAAAC TC TGA 



ID=1 0063728 



Page 215 



FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYI SMKELKQALVNC3SIWSS FNDETCLMMINMFDKTKSGRI DVYGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
I Signal peptide: 

I amino acids 1-19 

I N-glycosylation site. 

I amino acids 147-150 

I 

s 
I 

Casein kinase II phosphorylation sites. 

I amino acids 135-138, 150-153, 202-205, 271-274 

• N-iriyristoylation sites. 

I amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 

j 66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 

CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCT GGGGGAATAT TCAT C TATACCCAGG TATCAATAACC CAT TTCCTCCT GGAG TT C T CCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

TGC TCAT CT C CAATAAAATAAAAGCAC TTATGAAAA*\AAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 6a. 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6* 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATGG GGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAASAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 



APP ID-10063728 



Page 219 of 322 



FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

ACGGACCGAGGGT T CGAGGGAGGGACACGGACCAGGAACC TGAGCTAGGT CAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCAGTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AG AC TCTGCCCCT G AGG T C AAG AG AG GAT G G GG C TAT T C AC T T T TAT AT AT T T AT AT AAAAT TAG 
TAGT GAGAT GTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGIiALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



Important features : 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-4 0 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites . 

amino acids 7-13, 26-32 
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FIGURE 6q 

GCCAGGAATAACTAGAGAGGAACA ATG GGGTTATTCAGAGGTTTT GTTTTCCTCTTAGTT CT GTGCCTGCTGCACCAG 

TCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAGCTT CTACGT ACCT GT TT GAAGC CACAGAAAAAAGA 
TTTTTTTTCAAAAATGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATGAA 
AAC CATAAACAT GCT GAT GT TATAGTT GCACCACCTACACT C C CAGGT AGAGATGAAC CAT ACAC CAAGCAGTTCACA 
GAAT GT GGAGAGAAAGGCGAATACATT CACTT CACCCCTGAC CTT CTACT T GGAAAAAAACAAAAT GAATAT G GACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACC GTG C TAAGT CAAAAAAAAT C GAAGCAACAAG GTGTT CC GCAGGTAT CT CTG GTAGAAATAGAGT TT ATAAGT GT 
CAAGGAG GCAGC T GT CT TAGTAGAGCAT GCAGAATT GATT CT ACAAC AAAAC T GTAT G GAAAAGAT T GT CAATT C TT T 
CCT GATAAAGTAC AAAC AGAAAAAGCAT CC ATAAT GTTTATG CAAAGTAT T GATT CT GTTGTTGAATTTTGTAACGAA 
AAAACCCATAAT CAAGAAGCTCC AAGCCTACAAAACAT AAAGT GCAAT TT TAGAAGTACAT GG GAG GT GATT AGCAAT 
TC T GAGGAT TTTAAAAACACCATACCCAT GGT GACACCACCT CCT CCAC CT GTCTTCT CATT G CT GAAGAT CAGT CAA 
AGAAT T GT GT GCTTAGTT CTTGAT AAGT CT GGAAGCAT GGGGGGTAAGGAC CGC CTAAAT C GAAT GAAT CAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AAT AAGCTAAT C CAAATAAAAAGCAGT GAT GAAAGAAACACACTCAT GGCAGGATTAC CTACATAT C C TCT GGGAGGA 
ACT T C C AT CT GCT CT GGAAT TAAAT AT GCATTT CAGGT GATT G GAGAGCTACATT C CC AACT CGATG GAT CCGAAGTA 
CT GCT GCT GACT GAT GG GGAGGAT AACACT GCAAGTT CTT GTATT GAT GAAGT GAAACAAAGT G GGG CCATT GT T CAT 
TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 
T CAGATGAAGCT CAGAACAATGGCCT CAT T GAT GCTTT TGGGGCT CTTACAT CAGGAAAT ACT GAT CT CT CC CAGAAG 
TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
GT G GGAAAGGACAC GTT CTT T CT CATCACATGGAACAGT CTGC CT CC CAGTATTT CT CT CT GGGATCCCAGT G GAACA 
ATAATGGAAAATTTCACAGTGGATGCAACTTCCAAAATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACT 
T GGGCATACAAT CTT CAAGCCAAAGCGAACCCAGAAACAT TAACTATTACAGTAACTT CT CGAGCAGCAAAT T CT TCT 
GT G C CTCC AATCACAGT GAATGCTAAAAT GAAT AAGGACGTAAACAGTTT CC CCAGCC CAAT GATTGT TT ACGCAGAA 
ATT CT ACAAGGATAT GTAC C TGTT CTT GGAGCCAATGT GACT GCTTT CAT TGAAT CACAGAAT GGAC ATACAGAAGTT 
TT G GAACTTT TGGATAATG GT GCAGGCGCT GAT T CTTT CAAGAAT GAT G GAGT CTACT C CAG GTATT T TACAGCATAT 
ACAGAAAAT G GCAGATATAG CTTAAAAGT T CGGGC TCAT GGAGGAGCAAACACTG CCAG GCTAAAAT T ACGGC CT CCA 
CT GAAT AGAG CC GCGTACAT ACCAGGCT GGGTAGT GAAC GGG GAAAT T GAAGCAAAC C C GCCAAGAC CT GAAAT T GAT 
GAGGAT AC T C AGACCAC CTT GGAGGATTT CAGC C GAACAGCAT C C GGAGGTGCATTT GT GGTAT CACAAGTCC CAAGC 
CTT C CCTT GC CT GAC CAATACCC AC CAAGT CAAAT CACAGACCT T GAT GCCACAGTT CAT GAG GAT AAGATTAT T CT T 
AC AT GGACAGCAC CAGGAGATAAT TTT GAT GTT GGAAAAGTT CAACGT TAT ATCATAAGAAT AAGT GCAAGTATT CT T 
GAT CTAAGAGACAGT TTT GATGATGCT CTTCAAGTAAATACT ACT GAT CT GT CACCAAAG GAGG CCAAC T C CAAGGAA 
AGCTT T GCATTTAAACCAGAAAATATCT CAGAAGAAAAT GCAACC CACATATT TATT GC.CAT TAAAAGTATAGAT AAA 
AGCAATT T GACAT CAAAAGTAT CCAACATT GCACAAGTAACT TT GTTTAT CC CT CAAGCAAAT CCT GAT GACAT T GAT 
CCT ACACCTACT C CTACT CCT ACTCCTACT CCT GATAAAAGT CATAAT T CTG GAGTT AAT ATTT CTAC GCT GGT ATT G 
T CTGT GATT GGGT C T GTT GTAATT GTTAACTTTATTT TAAGTAC CAC CATTTGAAC CTTAAC GAAGAAAAAAAT CTT C 
AAGTAGAC CTAGAAGAGAGT TTTAAAAAACAAAACAAT GTAAGTAAAG GATATTT CT GAATCTTAAAATT CAT CC CAT 
GT GT GAT CATAAACT CAT AAAAATAATTT TAAGATGT C GGAAAAGGATACTTT GATTAAATAAAAACACT CAT GGATA 
TGTAAAAACT GT CAAGATTAAAAT TTAATAGTTT CATT TATT TGT TAT T T TATTT GTAAGAAATAGT GAT GAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAAT TAAAATC AT CTAT CT GAGTAGT CAAAATACAAGTAAAGGAGAGCAAATAAACAACAT TT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKOTSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVI GELHSQLDGSEVLLLT DGE DNTAS SCI DE VKQSGAI VHF I ALGRAADEAVI EMSK I TGG 
SHFYVS DE AQNNGL I DAFGALT SGNTDLS QKS LQLESKGLTLNSNAWMNDTVI I DS TVGKDT FFL 
I TWNSLPPS I S LWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANSSVPPITVNAKmKDVNSFPSPMIVYAEILQGWPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGVYSR Y FTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAY I PGW WNGE I EANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRY 1 1 RI SAS I LDLRDS FDDALQVNT TDLS PKEANSKES FAFKPENI SEENATH I FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VMFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CTCCTTAGGT GGAAACCCTGGGAGTAGAGTACT GACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCAC CT CCT AACCTT CCC TTCTT GAACC CC CAGTTATGC CAGGATT TACT AGAGAGT GT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGT AGAGAAGCT GCT CT GT GT GGTGGT TAACT CCAAGAGG CAGAACT C GTT CTAGAAGGAAAT GGAT GCAAG CAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
U AAT GGGCAGTAC CAAGCCAGC GATGCT GCTGG CCT GGGT CT GGACAGGAGCC CCCCAGAGAAAAC C C AGGCC GACCT C 

« * CT GGCCT T CCT GCACT C GCAGGT GGACAAGGCAGAGGT GAAT GCT GGCGT CAAGCT G GCCACAGAGT ATGCAGCAGT G 

j \ CCT TT CGAT AGCTTTACT CTACAGAAGGTGTAC CAGCT GGAGACT GGCCT TAC CCGC CAC C C CGAGGAGAAGC CT GT G 

p AG GAAG GACAAGCGGGAT GAGT T GGT GGAAGCCAT T GAAT CAGCCTT GGAGACCCT GAACAAT CC T G CAGAGAACAGC 

7f\ CCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACATTGTAT 
'*! \ GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGAC CAT T CAGCC C CAT CATGAAA 

l s || GT GAAAAAT GAAAAGCT CAACAT GGCCAACAC G CTTAT CAAT GTTATCGT GCCTCTAGCAAAAAGGGT GGACAAGTTC 

H CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
'*'} AAAGAAGAAAT AAATGAAGT CAAAGGAATACTT GAAAACACTT CCAAAGCTGCCAACT TCAGGAACTTTAC CTT CAT C 

f|j CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TT T TT CT GT GAT GT GGACAT CTACTTCACAT CTGAATT CCT CAATAC GT GTAGGCT GAATACACAGC CAGGGAAGAAG 
GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
CAGCAGCTGGTCATAAAGAAGGAAACTGGATTTTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGAC 
TT CAT CAATATAGGT GGGTTTGATCT G GACATCAAAGGCT GGGGC GGAGAG GAT G T GCAC CT TTAT C GCAAGT AT CT C 
CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCAT GAGAAGCGCT GCAT GGACGAG 
C T GAC C CCC GAGCAGTACAAGAT GT GCATGCAGT C CAAG GC CATGAACGAGGCAT C CCAC GGCCAGCT GGGCATGCT G 
GTGTTCAGGCACGAGATAGAGGCTCACCTT CGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATG&ACT CCCAGA 
GAAGGAT TGT GGGAGACACT TT TT C TTT CCT TTT G CAATTACT GAAAGT GGCT GCAAC AGAGAAAAGACTT CCATAAA 
111 | GGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 
f*\ AATCAAAAT CTC C G CTTT GCCT GCAAAAGTAACCCAGTTG CAC CCTGT GAAGT GT CT GACAAAGGCAGAAT GCTTGT G 

5f ? - AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
ni AATATTCAT GATTTAAGAGCAGTTTTGTAAAAAAT TCATTAGC AT GAAAGGCAAGCATATTT CT C CT CATAT GAAT GA 

GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACT 
AGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTTCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
ATTT T T ATAT TTT T TAAGAAGATACTT TGAGAT GCATTATGAGAACTT T CAGT T CAAAGCAT CAAATT GATGCCATAT 
CCAAGGACAT GC CAAAT GCT GATT CTGTCAGGCACT GAAT GT CAGGCATT GAGACATAGGGAAGGAAT GGTTT GTACT 
AATACAGAC GTACAGATACTTT CT CTGAAGAGTAT TTT C GAAGAGGAGCAAC T GAACACT GGAG GAAAAGAAAAT GAC 
ACTTT CT G C T TTACAGAAAAGGAAACT CATTCAGACT GGT GATAT CGT GATGT AC CTAAAAGT CAGAAAC CACATTTT 
CT C CT CAGAAGTAGGGACC G CTTT CTTACCTGTTTAAATAAAC CAAAGT ATACCGTGT GAAC CAAACAAT CT CT TTT C 
AAAACAGGGT GCT C CTCC T GGCTTCTGGCTT C CATAAGAAGAAAT GGAGAAAAATAT ATAT ATAT ATAT ATATATT GT 
GAAAGAT CAAT C CATCT G CCAGAAT CTAGT GGGAT GGAAGTTTTT GCTACAT GTTAT C CACC C CAGGCCAGGT GGAAG 
TAACT GAATTAT TT T TTAAATTAAGCAGT T CTACT CAAT CACCAAGAT GCTT C T GAAAAT TGCATTTTATTAC CATT T 
CAAACTATT TTT T AAAAATAAATACAGTTAACATAGAGT GGT TT CTT CAT TCATGTGAAAATTATT AGC CAG C ACCAG 
AT GC ATGAGCTAAT TAT CT CTTT GAGT CCT TG C TTCT GTTTGCT CACAGTAAACTCAT T GTTTAAAAG C TTCAAGAAC 
ATTCAAG CT GTT GGT GT GTTAAAAAATGCATT GT ATT GATTT GTACT GGTAGTTT AT GAAATT T AATTAAAAC AC AGG 
CCAT GAAT GGAAGGT GGTAT T G CACAGCT AATAAAATAT GAT TT GTGGATAT GAA 
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FIGURE 72 

MYOyrVRRGLLAWI SRVW^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 

AEWAGVKMTEYAAVPFDSFTLQKYYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 

ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNiytAN 

TLIWIVPl^RVDKFRQmQNFREMCIEQDGRVHLTVWFGKEEINEWGILENTSKAANFRNF 

TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVMYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 

NPGIIY'GHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNLIVTOTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVF 

RKQKQKT S S KKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGAC TGCAGAGGGAGAT AAAGAGAGAGGG C AAAGAGGCAGCA&GAG AT T T GT C C TGGGGATC CA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATG GAT AT T CAGACAGACAG TGTGACAT T T GACCC TGAT T TGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATAT AGC T TATGAACACAT TC TGAGT CAC T T GCAT GAAGTCAGGCAT AAAGATCAGAAGACC TC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTC TTCAC CTCAGCACA&GCCACGAC TGAGGCATAAAT T CC T T C TCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGT AAGCAGAT GTCC TCCT TCCCCT GGACATCTCT TAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74. 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFT SAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 



V Transmembrane domain: 

no 

V:: amino acids 291-310 

m 



N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

: i\ amino acids 197-245, 104-140, 22-69 

u 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GAT T T TGAT T TAT GGAAAT GAAT T T GACAAAAGAT TCT T TGTGCC TGC T GAAAAAATCGT GAT TA 
ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
AAAACACGGAAGGTACTTCTCT CACCCAGCAAGAGTCCC TCAGCAGAACAATACCCCCGGATAAA 
ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGGCTGAAGAGCAGGA 
GCT CAG T T TGCAGGAGGAGG T G T CC ACACAAGGAAC AT TAT T GGAGTCGCAGGC AGC GT T GGGAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CC AGACAGGCC AC CAGGAGAAAAT GAAAC C T AT CT CATGCAAT T C AT GGAGGAATGGGGG T TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNG-LHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKI IFWYVLPIS ITVFLFSVMGYSIYRYIHVGKEKHPANLILI YGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 



Important features: 



Signal peptide: 



amino acids 1-28 



fff 



Transmembrane domain: 



amino acids 140-163 



LI! 



N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
GTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCAC CCTCAGTCCCAC TG CAGT T C T CATC C TCGGCC CAAAAGT CAT CAAAGAAAAGC TGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
Q TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
O CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
p ; TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
f I CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
fi'f CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
Of T CCT GCCATCAAGGG TGACACCAT T CAGCT CTACCT GGGGGCCAAGT TGT T GGAC TCACAGGGAA 

* m AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
<:=?l TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
j!» ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
ft TGAT CAAT GAAAAGGCT GCAGAT AAGCT GGGAT CTACCCAGAT CGT GAAGAT CC TAAC TCAGGAC 

Q ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
fll TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 

GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGSLVNTVLKHI I WLKVI TANI LQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNNI S S DR I QLMNS G I GW FQPDVLKNI I TE I IHS I LLPNQNGKLRSGVPVS LVKALG 
FE AAES S LTKDALVL TPASLWKPSS PVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylati on si tes . 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGT CAGCC TGGCAGAGAGACT C T GAAATGAGGGAT TAGAGGT G T TCAAGGAGCAAGAGC 
T T CAGCC T GAAGACAAGGGAGCAGT CCCT GAAGACGC TTCTAC TGAGAGG T CTGCCATGGCC TC T 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CAT GAAATT TGAGAT TGGAGAGGC T CT T TAC T T GGGCAT TATT TC T TCC CT GT TCTC CC TGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GAC TAAT T T GTGCATGAAC TGAAATAAAACCATCC TAC GGTATCCAGGGAACAGAAAGCAGGAT G 
CAGGAT GGGAGGACAGGAAGGCAGC CT GGGACATT TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLG L PAD I QAAQAMMVT S S AI S S LAC 1 1 S WGMRC T VFCQE S RAKDRVAVAGGVFF I LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGI ISSLFSLIAGI ILCFSCSSQRNRSNYY 
DAYQAQPLATRS SPRPGQPPKVKSE FNS YSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTGCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACT GTCAAGAT CGTC CTGAAGGAGAAACATAAGAAAGC CT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAG TACC TGGCCCAAGGCGACACAGC CAGAAT C TTCCAC T T GACTCAGATCAAGAAAGTCAG 
GAAGCAAGAC TT CCAGAAAGAGGCACAGCAC T T CCGAC T GCT CGCT GGCC CC CACGAAGG TCAC T 
GGAACGT C T T C C T AGCCCAGACC CT GGAGC T GAAGGT CAC GGC CAG T C C AGACAAAGTGACCAAG 
ACAT AAC AAAGAC C TAAC AG T T GCAGAT AT GAGC T GT AT AAT T G T T G T TAT T ATATAT TAAT AAA 
TAAGAAGT T GCAT TAC CC TCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVLVHTS VS PS PDNLRRFALEHEAS DLVE I YLWKLVKDEE TEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

% Signal peptide: 

%l amino acids 1-25 



m 
CI 

?i\ 

f*\ 

ry 
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FIGURE 8^ 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AGGT GT C AGC ACT GGG CT C AGT TCCT CT C AT TTCCAT C ACGG GATAT GTT GAT AG AGACAT CC AGC TAC 



fl TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

fffl ACAGAC TCC AGG AC AAAC AGAG AC AT GC AT GGCCTGTT TGAT GT GGAG AT CT CTCTGACCGT CC AAGAGAA 

Llf CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

lU TT TT TT GGC ATT GT T GGACT GAAGAT TTT C TT CTCC AAAT TCCAGT GGAAAATC CAGGCGGAAC TGGACT G 

fri 

GAGAAGAAAGCACGGACAGGC AGAAT TG AGAGAC GCCC GG AAAC ACGC AGTGGAGGT GAC TC TGGATC CAG 

%» AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 

T% CCTC AC TCTGAG AAGAGATT T AC AAGGAAG AGT GTGGT G GCT T C TC AGAGTT T CCAAGCAGGGAAACATT A 

'3? 5 

M CTGGGAGGT GGACGGAGG AC AC AATAAAAGGT GGCGCG T GGG AGT G TGCC GGGATGAT GTGG AC AGG AGG A 

HI AGGAGT ACGT GACT T T GT CT CCCGAT CATGGGT ACT GG GT CC TC AG ACT GAATG GAGAAC AT TT GT AT TT C 

Q ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 

fff TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 



AAGGCT TAT T GAGGCCCT AC AT TG AGT ATC CGT C CT AT AATG AG CAAAAT GGAACT CCCATAGT C AT CTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGT C AGGAATT C CC ATC TCACAGGCTGTGGT GT AG AT T AAGT AGACAAGGAAT GT GAAT AATGCT TAG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 
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FIGURE Sd 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFS 

LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPL I S I TG YVDRD I QLLCQS SGW FPRPTAKWKGPQGQDLS T DS RTNRDMHGL FDVE I S L 
TVQENAGS I SCSMRHAHL S REVES RVQ I GDT FFE P I SWHLATKVLG I LCCGL FFGI VGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
T AC T T CT TT CGTAT GGAGAAAGGAAGT ATAAAATGGAAT TATAAACAT CACCGGC TC TC T G TGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
!;i AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
% GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
y GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
\! ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

■Hi 

GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
?M GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
01 CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
0 GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
^ CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

St TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

I II 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACT GC AGAGACTCAC CC TGAT T GAGGGATCACAGC CCC T CCAGGCAAGGGAGAAGT CA 
GAGGCT GATT CT TGTAGAAT TAACAGCCCTCAACGTGATGAGCTAT GATAACACTATGAATTATG 
T GCAGAGTGAAAAGCACACAGG CT T TAGAGT CAAAGTAT C TCAAAC C TGAAT CC ACAC T GT GCCC 
TCCC T T T TAT T T T T T TAAC TAAAAGACAGACAAAT T CCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLWSLQSKATSGVTQGWGGAGATALVFLSFCVIFWVRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPAS7VRSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGAC TCAGACAAGATTACAATGAACCAACTCAGCT TCCTGCTGT TT C 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
^ CAAC TACAACACCT TTGGATCTGCAGAGGCGGCCACGAGCGATGAC TACAAGAACCCTGGCTACT 

C! 

ACGACATCCAGGCC^GGACCTGGGCATCTGGCACGTC^ 

m AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
UJ GT T T GGC AT CTACCAGAAATAT C CAG T GAAATATGGAGAAGGAAAGT GT T GGAC T GACAACGGC C 

H CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
j 3 j{ GGCCAGCGGGAAT T CAC T GC GGGAT T TG T T CAG T T CAGGGTAT T TAATAACGAGAGAGCAGCCAA 

CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
Q GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
yl GGTVACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
O ^ZS^^^^^^^^^^^AGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 

CAAC T T ACCCAGT AGCT AGAAT G T T AAT GGC AGAAGAGAAAACAAT AAAT CATAT TGAC TC AAGA 
JS j j AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKAJDYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYS PYGQRE FTAG FVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82-, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGAT T T GT CGGC TTGCGGGGAGACT T CAGGAGTC GC TGTC T C T GAAC T T CCAGCC TCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCGGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACC TAAGGACGAAG CC TACAGGGGCAAGGGC CGC T TCGT ATT CCT GGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGAC T TATATACAGT TT TAGGGGACAAT TAAAAAAAAAAAA. 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VEIJVGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
C CAACACAATGCCACAGGCCAGGGAAACAT C T CCAT CAGC CT C GT GCC CCCCAGTAAAGCT GTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACAT CGCC T TC TACAGCACGGAC TAT CGGC TGG TCCAGAAGGT GT GCC CAGAT TACAAC TAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGG T CCCAAGTGCTGG T CC CAAC C T GAAGCT GTGGAGT GACTAGAT CACAGGAGCACTGG 

AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CC TTGGTTCT TGCCATCC TGAGGAAAGAT AGCAACAGGGAGGGGGAGAT T TCAT CAGT G T GGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACAC CCAT CC TTAAGCTAAGACAGGACGATTGTGGTCC TC CCACAC TAAGGC CACAGC C CAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AK ICS RDHAQ S S AT W S C S QP FKWC VY I AFYS T D YRL VQKVC P D YN YH SDTPYYPSG 

Important features of the protein: 
Signal peptide: 

\Z amino acids 1-14 

CI 

g\ N-glycosylation sites. 

UJ amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGA.CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
AT G T TC CGAT T TGCATAT TATAAACTCT TAAAAAAAGCCAGT GAAGGT T T GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
G GAGAT T C T CC T CAAT TC T TCC T TTAT TGAGC T T TCAT GACGC T GGT CAT TAT C T TGC T GCAT GT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCT GAAAC TC T GCC T GCT CTGCCAAGAGAAGAAC T T TCT T CT TT ACAAC CAGCGC TC CAGAT 
AACCT CAGGGAACC AGC AC T TCCCAAAC CGCAGAC TACATC TT TAGAGG AAGCACAAC TGT GC CT 
T T T T C T G AAAAT C CCTTTTTCTGGT GGAAT T GAG AAAG AAAT AAAAC TAT G CAGAT A 
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FIGURE qa 

MTAAVFFGCAFIAFGPALALYVFTIAIEPLRI IFLIAGAFFWLVSLLISSLVWFMARVI IDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFI S S Y YG INLAS AFI I LVLMG T WAFLAAGGS CRS LKLCLLCQDKN FLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site* 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65' 
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FIGURE 

AATTTTTCACCAGAGTAAAGTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC T ACCAAAC C AAC AGCAGT CAAAT CAGGTC T TTCCTTCTT T AAGT CT GATAC CAT T AAC A 
CAGAT GC T CACAC TGGGGC CAGAT CTGCAT CT GT TAAAT CCTGCTGCAGGAATGACAC C TGG TAC 
C CAGAC C C ACCCATTGACC C T G GGAGGGT TGAATGTACAACAGCAAC TG CACC CACATGTGT TAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCAT CC AAAGGAGCACACAT GCCATCGAGGAAGCCACC ACAGAAT C AGCAAAT GGAAT T CA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TGAT TATATTATGGAATAGATTGAGACACATTGGATAGTCT TAGAAGAAATTAAT TCT TAATT 
TAC C T GAAAAT AT T C T T G AAAT T T C AGAAAAT AT G T T C TAT G T AG AGAAT C C C AAC T T T T AAAAA 
CAATAAT TCAAT GGATAAATC TGT CTT TGAAATATAACATTAT GC T GCC T GGATGATATGCATAT 
T AAAAC AT AT T T GGAAAAC TGGAAAAAAAAAAAA?^UyU\AAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLP I FVTQLGAQGT I LSSEE 
LPQIFTSLIIHSLFPGGILPTS QAGANPDVQDG S L PAGGAGVNPAT QGT PAGRL P T P S G 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCC'CCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCT.TCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGeATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCAC TGCCACCGTGGAAGTC TCCATCAT AGAGAGCACC TGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
TGCAGAGGGAAAC CTCTACGT GAC CAGAGAGC T GGACAGAGAAGCC CAGGCT GAGTACCTGC TCC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
ATTCCCACGTTGTG'TATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAG TCGCAGT CACAGATAT CAAT GAT CACGC C CC T GAGT TCAT CAC T TCCCAGAT TGGGC C T 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
GCCT GGAT T GGGAGCCAGAC TCTGGGCATGT TAGAC TCAGACT C T GCAAGAACC TCAGT TATGAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATAGAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T TT CACC CAC T GGACCAT GTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGT GC CCC T 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
SGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVSIIESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPT S GSVTLGVLPLRAGQNI LLLVLAMDLAGAEGGFS S TCEVEVAVTDINDHAPE FI TSQ I G 
f 'f PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 

Q 

f H EAAPSHEWWVQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 

Q) IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 

III QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 

'2\ REHIIPVWSHNAQMWQLLVRVIVCRCWEGQCMRKVGRMKGMPTKLSAVGILVG 

ji{ L I FTHWTMSRKKDPDQPADSVPLKATV 

Q Signal peptide: 

L|'j amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT TC TTCCCAGGAACACAAAC G TAGGAGACC CACGCT C C T GGAAGCACCAGC C T T TA 

TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATC TTGCTT TCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
T TC T CC T TAT GT T TGGT C TAC TAT T GCAT T TAGAAGC TGCAACAAAT T CCAAT GAGAC TAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGT TCCATACAACC TCCAGTGGGAT CAGCACAGCCACCAACTC TGAG 
T T C AG C AC AG C G T C C AG T GGG AT C AG CAT AG C C AC C AAC T C T GAG T C C AG C ACAAC C T C C AG T G G 
GGC CAGCACAGCCACCAAC T C T GAGT CCAGCACACCC TCCAGT GGGGCCAGCACAGT CACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAAC TC TGAC T CCAGCACAGT GTCCAGTGGGGCCAGCAC TGCCACCAAC TC TGAG 
T C CAGCAC GACCTCCAGTGGGGCCAGCACAGCCAC CAAC TC T GAG T CCAGCACGACC TCCAGT GG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAACT CT GAG T CCAGCACAGTGTC CAGT GGGGCCAGCAC TGCCACCAACT CT GAGT CCAGCACAA 
C C T C C AG T GG G G T C AG C AC AGC C AC CAAC T C T GAG T C CAGCAC AAC C T C C AG T GG GG C TAG C AC A 
GCCACCAACT CTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAAC TC TGAGTCTAG 
CACAG T GT C CAGTGGGATCAGCACAGT CACCAAT T CTGAGT C CAGCACAACC TCCAGTGGGGCCA 
ACACAGCCAC CAAC TC TGGGTCCAGTG T GACCTC TGCAGGC T C TGGAACAGCAGCTC TGAC TGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
T CT TC TT CT G TG TGAGAAACAGC C T GT CCC T GAGAAACACC TT TAACACAGC TGTC TACCACCC T 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G T GGAGT CC T AAC T GGT TC TGGAGGAGAC CAG TAT CAT CGATAGCCAT GGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGT^AGCAAGTGCCGCATTCTTCAGGAAGGTIAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C T T G AAGAAGG T AT T C C T C AC CTTTCTTGCCTT TAC C AGAC AC T GG AAAG AG AAT AC TAT AT 
T GC TCAT TTAGC TAAGAAATAAATACATCT CATC TAACACACACGACAAAGAGAAGC TGT GC T T G 
C CCCGGGG T GGGTATC TAGC TCT GAGATGAAC T CAGT TATAGGAGAAAACC T CCAT GC TGGAC T C 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCT^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
S VT SNGVS I VTNSE FHT TS SGI STATNSE FS TAS SG I S I ATNSE S S TT S S GAS TATNSES S T PS S 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
S T T S S GANT ATNS G S S VT SAG S GTAALT GMHT T S HS AS T AVS E AKPGGS LVP WE I FL I TL VS WA 
AVGLFAGL FFCVRNSLSLRNT FNTAVYH PHGLNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AAC GAG G CAG T T T C TAG C T AC T G C AC G TACT T C ATAAAGCAGGAC T C T AAAAGC T T T GGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TAT AAC T GGGGAT GCAT TTGTTCCT GGAGAAAGAAGT GTC AT TAT CAT GAACCAT C GGACAA 

GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTTTTGTG 
AT AT TCACGAAC C AC T T CAACT CC T CATAT T CCCAGAAGGGACT GAT C T C ACAGAAAACAGCAAG 
TCTCGAAGTAATGCAT TTGC T GAAAAAAATGGACT TCAGAAATATGAATATGTTTTACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
AT AT CACT G T GGCG T ATCC TCACAACAT TCC T CAATCAGAGAAGCACCT C CT CCAAGGAGACT T T 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCT T CAAC T C TGGTGCCACAAACGG TGGGAAGAGAAAGAAGAGAGGC TGCGTTCCTTC TATCAAG 
GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATAT T T GGT GGAC TGGAGATCATAGAACT TGCATGT TACC GACT T T TACACAAACAGCCACAT 
TTAAAT TCAAAGAAAAATGAG TAAGAT TATAAGGT T TGCCATG TGAAAAC CTAGAGCATAT T T T G 
G AAAT G T T C T AAAC C T T T C T AAG C T C AGAT G CAT T T T T G CAT G AC TAT GTC GAATAT T T C T T AC T 
GCCAT CAT TAT T TGT TAAAGATAT T T T GCAC T TAAT T T TGTGGGAAAAAT AT T GC T ACAAT T T T T 
T T T AAT C T C T GAAT G TAAT T T C GAT AC TG TGTACATAGCAGGGAGT GAT CGGGGTGAAATAAC T T 
GGGCCAGAATAT TAT TAAACAATCATCAGGC T T T TAAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPmFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVIITGDAFVPGERSVIIMNHRTRMDWM 
FLWNC LMR YS YLRLE K I C LKAS LKG VPG FGWAMQAAAY I F I HRKWKDDKS H FE DM I D Y FC D I HE P 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATC TAAGCAGATGCCACAGTATCGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGAT T T GTC T TCAGACT C CAGAGCAAAT GCAGAT GGGTACAGCCT GTAT GAT GTGGAGAT CT CC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
G GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
□ TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
ffi AAAT CCAAAGGGAAAAT CCAGGCGGAACT GGAC TGGAGAAGAAAGCACGGACAGGCAGAAT T GAG 

■ d AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
ft T T TC TGAT CT GAAAACT GTAACCCATAGAAAAGCTCCCCAGGAGGT GCC T CAC T CT GAGAAGAGA 

TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
fll CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ffi AT G T G AC TTTGTCTCC C AACAAT G G G TAT TGGGTCCT C AGAC T G AC AAC AG AAC AT T T G TAT T T C 

T ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
« 5 GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
T GACAT GTCAGT T T GAAGGC T T GT T GAGACC CTATATCCAGCATGCGAT G TAT GACGAGGAAAAG 
If! GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
Q CCCACAC CACAGACC CAGACACAGCCAAGGGAGAGT GC TCCCGACAGGT GGCCC CAGC T T CCTC T 

7i\ CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
M TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
f|| CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT TC C TGCC TCACAGG T GAAGAT TAAAGAGACAACGAAT G TGAATCAT GCTTGCAGG T T 

TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCTTATTC CAC AT T AAT T TACT T T T C T C TAT AC C AAAT CAC C CAT G GAAT AG T TAT TGAAC AC C 
T GC T TTG TGAGGC TCAAAGAATAAAGAG GAG GT AGGAT T T T TCAC T GAT TC TATAAGCC CAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAAC T ACAGGT CCATAT CC 
CT CAT TAACACAGACACAAAAATT CT AAATAAAAT T T TAACAAAT TAAACT AAACAATATAT T T A 
AAGATGAT AT ATAAC TAC TCAGTG T GGT TT GT CCCACAAAT GCAGAG T T GG T T TAATAT T TAAAT 
AT CAAC CAG T G T AAT T CAGCAC AT T AAT AAAG T AAAAAAGAAAAC C AT AAAAAAAAAAAAAAA 
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FIGURE lOA 

MAFVL I LVLS FYE LVS GQWQVT G P GKFVQALVGEDAVFS C S L FPE T S AEAME VRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LC S I HLAEQS HE VE S KVL I GE T FFQP S P WRLAS I LLGLLCGALCGWMGM 1 1 VF FK 

SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 

TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 

FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 



APP_ID=1 0063728 



Page 258 of 322 



FIGURE to* 



CCTTCACAGGACTCTTCATTGCTGGTTGGCAATGATGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT T C AT TATGTG AGAT AT AATCAAAAGAAG ACCT AC AATT AC T ATAGC AC AT T GTC AT TT AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
C AATGGTG AAAAAT GC AT TTT ATAAATCT C CATTAAGGGAAG AATT T G TC AAGT CT CAGGTT AT CAAGTT C 

AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
t , AT CCT C ACT C AGTT AAAATT AAAAAAATCAAC AAGACAGAAACAGACAGC TAT C TAAAC C AT TGCTGC GGA 

ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 
Q GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
31 TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
ill ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
If* TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
yj TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
' M GATGGT TACAGTCAAAAT CATCTT CGACAAGC AC AGGTGACTCT CAT AGACGCT AC AACT TGCAATGAAC C 

I-. TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
f|j GCCAGG GT GACTCT GGAGGACCACTGGT T AGT TCAG AT GC T AGAGAT ATC TGGT ACCT T GCT GG AAT AGT G 

%% AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
S| GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
Q GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
f|| ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATCAACTCTGTCATCTGTGAGCAATAGTTGAAACTTTATGTACATAGAGAAATAGATAATACAATATTAC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
T C AAGGTGCAGAAC AAGGAGTGAAAGAAAAT AT AAGAAGAAAAAAATCC C CT AC AT T T TATT GGC ACAGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
T T AC T GAGGATGTC AACAT AT AACAAT AAAAT AT AAAT C ACCCA 



APP_ID=1 0063728 



Page 259 of 322 

i 



FIGURE 106 



MMYRPDWRARKRVCWEPWI GLVI FI SLI VLAVC IGLTVHYVRYNQKKT YNYYS TLS FTTDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVI^JiMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVTIKPSKMKRGL 
RRI I VHEKYKHPSHDYDI SLAELSSPVPYTNAVHRVCLPDAS YEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 1Q7 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
AT AAT G T T C T CAC T T T AAC TG CAGAAACGAGGG TAGAGGAAGCAGT CAT T T T GAC T T AC T T T C C T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
G TG T AGAAC TGGC T T G TGGCGT T TGGACATAT GAACAGGAAC T TAT GGT T CCAG TACAAT GGTCA 
GAT AT G G T CAC T T T GAAAGCCAGGAT GAC AAAT T AT GGAT T AC C T AGAT AT CGGTGGCT T AC T CA 
T GCT TGGAAT T T T T T T CAGAGAGAGT TTAAGT GCTGTGGAG TAG TAT AT TT CAC TGAC TGG T TGG 
AAAT GACAGAGATGGAC T GGCC CC CAGATTCC TGC TG TGT TAGAGAAT T C C CAGGATG TT CCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GATGTC C T TGAAGAATGACAAC TC TCAGCACC T GT CAT G T CC C TCAG TAGAACT 
G TT GAAA.CCAAGC C T GT CAAGAAT C T T T GAACACACATCCATGGCAAACAGCT TTAATACACACT 
TTGAGATGGAGGAGT TATAAAA?VGAAATGTCACAGAAGAAAACCACAAACT TGTT TTATTGGACT 
TGTGAATTTTTGAGTACATACTATGTGTTTCAGAAATATGTAGAAATAAAAATGTTGCCATAAAA 
TAACACCTAAGCATATACTATTCTATGCTTTAAAATGAGGATGGAAAAGTTTCATGTCATAAGTC 
AC CAC C T G GAC AAT AAT T GAT G CC C T TAAAAT GC T GAAGAC AGAT G T CAT ACC C AC T G TG TAG C C 

TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
T CCAT GCAAACGAGT CACATATGGT GGGACT GGAGCCATAGTAAAGGT TGAT T TAC T TCTACCAA 
C TAG T AT AT AAAGTAC TAAT TAAAT GC TAACATAGGAAGT TAGAAAATAC TAATAAC T T T TAT TA 
CT CAGCGAT C TAT TC TTC TGAT GCTAAATAAAT TATATAT CAGAAAAC T T T CAATAT TGGTGACT 
AC C TAAATGT GAT T T TT GC TGGT TACTAAAATATTC T TAC CACT TAAAAGAGCAAGC TAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
TTCAGTTCTGATAATGTTAAGAATAACCATTATG7UVAAGGAAAATTTGTCCTGTATAGCATCATT 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TAT T T AAAT AC T T AAC CAC TAAT T T T G AAAA T TAC CAG TGT GAT AC AT AG G AA T CAT TAT T C 
AGAAT G TAG TCTGGTCTT TAGGAAGTAT TAATAAGAAAAT T TGCACATAACT TAGT T GAT T CAGA 

AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TAT CTTTGCCTTCTC C AAAC AAG AAG C AATAG T C T C C AAG T CAATAT AAAT T C TAC AG AAAA 

TAGTGTT CT TT TTCTCCAGAAAAATGCT TGTGAGAATCATTAAAACATGTGACAAT TTAGAGAT T 
CTTTGTTT TAT T T CAC TGAT TAAT AT AC TGT GGCAAAT TACACAGAT TAT TAAAT T T TT T TACAA 
GAG TATAGTATAT T TAT T TGAAAT GGGAAAAG TGCAT T T TACT GTAT T T T GT GTAT T T TG T T TAT 
TTCTCAGAATATGGAAAGAAAAT TAAAATGTGTCAATAAATAT TT TCTAGAGAGTAA 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLM^ 

VMIAVCCFLIIVGMLGYCGTVKRNLLLIAWYFGSLLVIFCVELACGVWTYEQELMVPV 

lkarmtnyglpryrwlthawnffqrefkccgvwftdwlemtemdwppdsccvrefpgcskqahq 
edlsdlyqegcgkkmysflrgtkqlqvlrflgisigvtqil™iltitllwalyydrrepgtdqm 
mslkndnsqhlscpsvellkpslsrifehtsmansfnthfemeel 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE loo 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGT GAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
japfe CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
Cf CAGCTGCTCAGATACCAGCGAGTGCACATGCGCTCCAGGAACTGTGCTAGGCCCTGACAGGCAGACATGCTTTG 
Q AT GAAAAT GAAT GT G AGC AAAAC AAC GGT G GCT GC AGT GAGAT CT GT GT GAACC T C AAAAAC TCCT AC C GCT GT 

3} GAGTGT GGGGT T GGCCGT GT GCT AAGAAGT GAT GGCAAGACTT GTGAAGAC GTTGAAGGAT GCC AC AAT AAC AA 

W TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
•** CT G AGGAT AAC C AC AC T T GCCAAGT CC CT GT GT T GT GC AAATC AAAT GC CAT T GAAGT GAACAT C CC C AG GGAG 

S - CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 

m 

CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
l H CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACC T G C G AGT T T C CAC GC C T GT ACACC AT T T CT GAAG GAT AC GT T CC C AACC TT CGAAAC TC CC C AC T GGAAAT 
*■{ CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
S{ GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
pi TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGAT GGC T GT GT T TC A GAT G ACT C GGT AAAGC AGT AC AC AT CCC G GGAT C ACC T AGC AAAGC AC T T C C A GG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
C AGGT CAC AGC AC T GC T GAACAAT GT GGC CT GGGT GGGGT TT CAT CT T TCT AGGGT T GAAAACT AAAC T GT C C A 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT CAGAAGC T GG GT ATAATAT TT C AAGT T AC AAACC C T AGAAAAAT T AAAC AGT TACT GAAAT TAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TT AT AGG GAAT T T GG AAGT GT AT C AAT AAAAC AGT AT AT AAT T T T 
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FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
Ll LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

a 

pi Important features of the protein: 

0| Signal peptide: 

y| amino acids 1-16 

i M N-glycosylation sites. 

rfs 

!s§s amino acids 89-93, 116-120, 259-263, 291-295, 299-303 



Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 



N-myristoylation sites. 

amino acids 22€ 
522-528, 531-537 



til amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 



Aspartic acid and asparagine hydroxy la tion site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 



Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 

GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGC TGT GTG ACG GAGAGC TGGACT GTCCCT TGGGGG AG GACG AGGAGC ACT GTGTC AAGAGC TT C 

CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 

AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 

GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 

GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 

L|| TGGGAAGAGCCTGT^AGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCAl'CCTGGACCCCCACTGGGTCCTCACGGCA 

GC CCAC TGCT T C AGGAAACAT ACCGATGTGTTC AAC TGGAAGGT GCGGGC AGGC TC AGACAAAC T GG GC AG 

CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 

TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 

GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 

GATGTC TGAC AT ACT GCT GC AGGCGT C AGT CCAGGTC AT TGACAGC AC ACGGT GCAAT GC AG ACGATGCGT 

ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 

? ! GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

]| CT GCGGGGGCC C GAGC AC CC CAGG AG TAT ACACC AAGGTC TC AGCC TAT C TCAACT GG ATC TAC AAT G TC T 

GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 

GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 

TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 

GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 

CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 

CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 

GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 

ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 

TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 

TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 

TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 

GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 

CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 

GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 

CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 

AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 

GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 

ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 

CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 

GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAAC T TGT GCCAAAT TATGGGT CAGAAAAGATG 

GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 

CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 

CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 

TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 

CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 

GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 

C T CAAT T T AAATCAT GT TC TAGTAAT TGGAGC TGTC CCCAAGACCAAAGGAGCTAGAGC T TGGT T 

CAAATGAT CT CCAAGGGC CC T T ATAC CCCAGGAGAC TT TGAT TTGAATT T GAAACCCCAAATC CA 

AACCTAAGT^ACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 

T TT GGGAGGCC GAGGCGGGTAGATCACCTGAGGTCAGGAGT TCAAGACCAGCC T GGCCAACATGG 

TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 

CCAGC T AC T CGGGAGGC T GAGACAGGAG AAT TAC T T GAACC TGGGAGG T GAAGGAGGC TGAGACA 

GGAGAATCACTTCAGCCTGAGCAACACAGCGAGAC TCTGTCTCAGAAAAAATAAAAAAAGAAT TA 

TGGTTATTTGTAA 
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FIGURE ua 

MLWWLVLLLLPTLKSVFCSLVT'SLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



a 

CJ 

yj 
M 

ni 
m 



a 
m 

Q 

flj 
a 
m 
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FIGURE lis 

CAGCAGT GGT C T CT CAGT C C T C T CAAAGCAAGGAAAGAGTACT GT GT GCTGAGAGACCATGGCAA 
AGAAT CC TCCAGAGAAT TGTGAAGAC TGT CACAT TC TAAATGCAGAAGCT T TTAAATC CAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T C TACAGC AAT GGAGAGAAGAAGAAG AT T TAC AT GGAAAT T GAT CC TGTG AC CAGAAC TGAAATA 
TTCAGAAGCGGAAATGGCAC TGATGAAACAT TGGAAGTGCACGACT TTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGAT GAGAAT GAAGAAAT TACCACAAC T TT C T T TGAACAGTCAG T G 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCT7^ATATCAGTTTCTGAGTTACAAG 
AC T T T GAGGAGGAGGGAGAAGAT C T T CAC T T T CC TGCCAAC GAAAAAAAAGGGAT T GAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAAC T T CCAATAAATGAC TAT AC T GAAAAT GGAATAGAAT T T GAT CCCATGC T GGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
T GC T GGCAAC ATAT AATAAAT GCAT GC T AT T C AATGAAT TT CT GCCTAT GAGGCATC TGGC CCC T 
GGTAGCCAGC T C T CCAGAAT TACT T GTAGGTAAT TC C TC TC T TCAT G T T CTAATAAACT TC TACA 
T TATCACC AAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGI^LTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYME IDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGG CT GCAT CG C GGCC AC CGGGATGGACAT GT GGAGC ACCC AGGACCTGT ACG AC AACCCC GTC ACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
jsje ; TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
P GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
Q TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
0! GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
Ul TGGGTCCAAC AC CAAAAACAAG AAGAT AT ACGATGG AGGT GCC C GC AC AG AGGACGAGGTAC AATCT T ATC 

CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 

Pi | 

: CC C AAAAAACAAGG AGAT CC CATCT AGATT T CT T CT TGCT T T TGAC T CAC AGCTGGAAGTTAGAAAAGCC T 

rfi 

CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
%z GCTGAGTT ATTT AT GAAT T AGAGGCT AT AGCT CACATT TT CAATCCTCT ATT T CT T TT T TT AAATAT AAC T 

I ft TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
rh TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
til GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
Q AC ACTT ACTG AAGAAGAAG C AAT AAG AG AAAGAT ATTT GT AAT C TCTCC AGCCC AT GAT CT C GG TT T T CTT 

til ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AG AAAT TGTCCC T AGAT GAAT G AGAAAATT AT T TT T TTTAAT TT AAGT CC TAAAT AT AGT TAA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT C ACACCTGT AAT CCT AGC ACTT TGGGAG GC TGAGGAGGAAGGATCACT T GAGCC CAGAAGT 
TC GAGACT AGCCTGGGC AAC AT GG AG AAGC CC TGT CT CTACAAAAT AC AGAGAGAAAAAAT C AGCC AGTC A 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFT I LGLPAMLQAVRALMI VGIVLGAIGLLVS I FALKC I R IGSMEDSAKANMTLTSGIMFI VSGL 
CAI AGVS VFANMLVTNFWMS TANMY T GMGGMVQ T VQTR Y T FGAAL FVGWVAGGL TL I GGVMMC I A 

CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE no 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
U"; CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
bf ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
'r: C AGAT AC T C GATACC T T CC C AT CG CAC AAC C C AAAAAAG T TAT CACAC C GGAAAGAAGT C ACC GAGC G 

ill 

;. | TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
f f j ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
l|| AAT CTTAATTACAGGAACTGTGCATCAGCTAT TTATGAT TCTATAAGCTATTT CAGCAGAATGAGATA 

.8 T TAAACC CAAT G C TT T GAT T GT T C TAGAAAGT ATAG T AAT TTGTTTTC T AAGG TG GT T CAAGC AT CT A 

CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

1 1! 

J; ACGAC ATAGCAT TAT GT ACAT AGAT GAGT G T AAC AT T TAT AT C T CACAT AGAGAC AT GC T TAT AT GG T 

TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
ATCATGGATAGGGTT GAAGAAGGT TACTATTAATT GTT T AAAAACAGCT TAGGGAT TAAT GTCCT CCA 
TTTATAAT GAAGATTAAAATGAAGGCTTT AATCAGCATT GTAAAGGAAATTGAAT GGCT T TCTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTT GTGTAT TAAATTAACAT TTTTAAAAC GCAGATATTTTGTCAAGGGGCT T TGCAT TCAAACTGCT T 
T T C C AGGG C TAT ACT CAGAAGAAAGAT AAAAG T GT GAT C T AAGAAAAAGTGAT GGT T T TAGGAAAGT G 
AAAAT ATT TTTGT TT TT GTATTT GAAGAAGAATGAT GCATT TTGACAAGAAATCATATATGTAT GGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT AC AT T T AT ATTAATAAAT TGT ACAT T TT T CTAATT 



n 
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FIGURE 120 



MAT HALE I AGL FLGGVGMVG TVAVTVMP QWRVS AF I ENN I WFENFWEGL WMNCVRQANI RMQCK 
I YDSLLALS PDLQAARGLMCAASVMS FLAFMMAI LGMKCTRCTGDNEKVKAHILLTAGI I FI I TG 
MWLIPVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYS I PSHRT TQKS YHTGKKS PSVYSRS QYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGG AGAAAAGGGGGAAT G T C T G AGGGAAAGC T T T GAGGAGTCC T GGACACCCAAC T AC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT GAAGCTATAAT TTAT TTGGACCAAGGAAGCCCTGAAATGAAT TCAACAATTAATATTCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT T AT TGAAGAAC TACCAAAATAAATGCT TT AATT TTCAT T TGCTAC CT C T T T T TT TATTATGCC 
T TGGAAT GGT TCACTTAAATGACAT T T TAAATAAGTT TAT GTAT ACATC T GAAT GAAAAGCAAAG 
CTAAATATGTTTACAGACCAAAGTGTGATTTCACACTGTTTTTAAATCTAGCATTATTCATTTTG 
C T T CAAT CAAAAGT GGT TT CAAT AT T T T T T T T AGT T GGT TAGAATAC T T TC T T CATAGT CACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 
TAAATAAAAAT TAT T T CCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 



APP ID=1 0063728 



Page 276 



FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACC TAAGGAC GACC T T TGAACAAT ACAAAGGATGGG T TT CAAT G 
TAAT TAGGCTACTGAGCGGATCAGC T GT AGCAC TGGT TATAGC CCCCAC T G TC T T AC TGACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
^ ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
CI CTTGACCATAACCATATCAGCAATATTGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGA 
Q GCT GAT TC TT AGT TCCAATAGAAT CT CCTAT T T TC T TAACAAT AC CT T C AGAC CTGT GACAAAT T 

ff! TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
f \ CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
f\ AGAC TGCCGCAACC TGGAACT T TT GGACCT GGGATATAACCGGAT CC GAAGT T TAGCCAGGAAT G 

\l TCTTTGCTGG CAT GAT CAGAC T CAAAG AAC T T C AC C T G G AGCAC AAT CAAT T T T C C AAGC T CAAC 

ffj CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
ffi C ATAGGACAGACCATGT CCT GGACCTGGAGCT CCT TACAAAGGCT T GAT T TATCAGGCAATGAGA 

TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
T CCAACAAGC TCACAT T TAT T GGT CAAGAGAT T TT GGAT TC TT GGATAT CCCTCAATGACATCAG 
51 TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
jjl T TAAAGGT CT AAGGGAGAATACAAT TAT C TGTGCCAGTC CCAAAGAGC TGCAAGGAG TAAATGT G 

P| ATC GAT GCAG TGAAGAAC TACAGCATCT G TGGCAAAAGTACTACAGAGAGG T T T GAT C TGGCCAG 

7k GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
Q- TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
f|f CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGAC TCCCAGCACCCAGGAATT TTATGTA 
GATTATAZ^ACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
T AAC AAAT C G G G C T C C AG G GAG T G T G AGG T ATGAAC CAT T G T G AT AAAAAG AGC T C T T AAAAG C T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATT TTCC TCTCATACATAAT CAACCCAT TGAAATTT AAAT AC CAC AAT CAAT GTGAAGCT T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GT T T TAAGATAAAAC TT C T T TCAT AGGTAAAAAAAAAAA 
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FIGURE 124 



5 
m 

ill 

\\ amino acids 1-33 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARWFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVWIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHI SFHKI IAGSVALFLSVLVI LLVI YVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMT PS TQE FYVDYKPTNTET S EMLLNGTGPC T YNKSGS RECEV 

Important features of the protein: 
Signal peptide: 



Transmembrane domain: 

s .. amino acids 420-442 



N-glycosylation sites. 

Q! amino acids 126-129, 357-360, 496-499, 504-507 

CI 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12* 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATT GAAGAGGAAGGC T T T C TAAAGC TT T GGCAAGGAGTGACACCCGC CAT T TACAGACACG T AG T 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCAT TGCGAT T TCGTGGTGTACAT CATGCATTTGCAAAAATCT TAGCTGAAGGAGGAATAC 
GAGGGC T T T GGGCAGGC TGGG T ACCCAATATACAAAGAGCAGCACTGGT GAATATGGGAGAT T T A 
ACCAC TTATGATACAGTGAAACACTACT TGGTATTGAATACACCACTTGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAG T CAG T CCAT TT TAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKRK^ 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 



n 



Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 



m 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGAGTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGT GGAT TTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRIJUVTGASGGIGAAVARMaVQQGLKWGCARTVGNIEELAAECKSAGYPGTLI 
P YRCDLSNEE D I LSM FS AI RS QHS GVD I C im^GLJ^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I SPGWE T Q FAFKLHDKDPEKAAAT YE QMKCLKPEDVAEAV I YVLS T PAH I Q I GD I QMRP TE QVT 



; t 



i \\ 

s~t. : 



Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28/ 24-30, 40-46, 90-96, 109-115, 



%l 199-205 

fli 



Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCC6GTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAATTTTTAACTGATAGTTGTACATATTTGGGGGTACATGTGATATTTGGATACATGTATACAA 
TATATAAT GATCAAATCAGGGTAACTGGGATATCCATCACATCAAACATT TATT TTTTATTCTT T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAAC TAAT T TTTGTAT TTT TAGTAGAGACGGGGT TTTGCCATGT TGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACGAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TCCATCCATGTTGCTGCAAATGACAGGAT T TCGT TC TTAATT TCAAT T AAAATAACCACACATG 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHG T LGE S PMP FKRVFC QDGNVRS FCVCAVH FS S HQP PVAVE C LK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 

TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACC ATGA TCAGCGCAGCCTGGAGC 
AT CTTC CT C ATC GGGACT AAAATT GGGCT GTT C CTT CAAGTAGC ACCT CT AT CAGT T ATGGC T AAATC CT G 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TTGAAAAACTTGCTGAAAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCT 
CCC AAAGT ATGT AAAAGAGT TAC ATT TGC AAGAAAAT AAC AT AAGG ACT ATC AC TT AT GATT CACTTT CAA 
AAATTC CCT ATC TGGAAG AATT AC AT T TAG AT GACAAC T CTGTC TCTGCAGT T AGC AT AG AAG AGGG AGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCC CAGGACT AT AG AAGAAC TACGCT T GGATGAT AATCGC AT AT CCACT ATTT C AT CACC AT CT CTT C AAG 
GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCCTGGTATTGCGGGTGCAAGATGAAATG 
GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
AGGTTC GT GGGATG GCT ATT AAGG AT CTCAAT GC AGAACTGTTT GAT T GT AAGG AC AGT GGGAT TGT AAGC 
AC C ATT C AGAT AACCACT GC AATACC C AAC AC AGTGTAT CCT GCCC AAGG ACAGTGGCC AGC TC CAGT GAC 
CAAAC AGCCAGATATTAAGAACCCCAAGCT C ACT AAGG AT CAAC AAAC CACAGGGAGT C C CT C AAG AAAAA 
CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
ACTGCT TT GAGACT C AGCTG GCT T AAACT GGGCC AT AGCCCGGC AT TT GGAT CT AT AACAGAAACAAT TG T 
AAC AGGGGAAC G CAGT GAGT ACTT GGT C AC AGCC CT GGAGCC TG ATTCAC CCT AT AAAGT AT GC ATGGTT C 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CG AAT GTAC AAC CC TAC AAC CACCCT C AAT CGAGAG CAAGAG AAAGAACC T T AC AAAAACCCCAATT T AC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT C AT AGG AATGGAT CGCTC TTC T CAAGGAACT GT GCATAT AGC AAAGGGAG GAGAAG AAAG GATGAC TAT 
GC AGAAGC TGGC AC T AAGAAGG AC AACT CT AT CC TGGAAAT C AGGG AAAC TT CTTT TCAGAT GT T ACC AAT 
AAGCAATG AACCC ATCTCGAAG GAGGAGTT T GT AAT AC AC ACCATAT T TC CTCC T AAT GG AATGAAT C TGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQ I NNAG IPS DLKNLLKVERI YL YHNS LDE FPTNLPK YVKE LHLQENNI RT I T YDS LS K I P YL 
EE LHLDDNS VS AVS I EEGAFRDSNYLRLL FLSRNHL S T I PWGLPRT IEELRLDDNR ISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KTOVRGLMCQAPEKVRGMAI KDLNAELFDCKDS G I VS T I Q I TT AI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCC^CTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
G CGC T GG AGAGAGC CACC GT C T TCC T AGAACAGAGGC T GCC T GAAAT CAAC C TGGATGGCAT GGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCAC T CC AACAGAGCC AGGAC TAT AT CAACCT CT T C TGCGCCAACAT GAT GGAC T TGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGC AGGAAGGAT GC T T CGGGGAGCC T GATGCT GAAGAT GAAGAAT TAT CTAAAGC TAT TCAA 
TATCAGCAGC AT T T T TCGAGGAGAG TGAAGAGGCGAGAAAAACAAT TT CCAGAT TC T CGC TC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCAC CAAGCAGCC GC TGAGACGGACGGT TCCATGCCAGCT GC C TGGAGGAGGAACAGAC CCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 



m 

m 
111 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKAT IADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCAJSIMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFY I FLVETGFHHVAHAGLELL I SRDPPTSGSQS VGL 



Q Important features of the protein: 

{ ?*.| 

Signal peptide: 

amino acids 1-26 

r|! Transmembrane domain: 

3| amino acids 39-56 



Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Ami da ti on site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 13S 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T T AC AGAT T T GAT CCC GT TC GAG T GGAT AT CAC T TCGAAAGGAAAAAT GAGAGCAAGATAT G 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATG7y\ATCTTCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATTCCAACCATGAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGG TCAATAT TGATGT CAC T GAAT TAATTACAGT GTCC TATAGAAAATGCCAT TAAT AAAT TAT 
ATGAACTACTATACATTATGTATAT TAATTAAAACATCT TAATCCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAZ\LWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYRFDPVRVDITSKGKMRARYWYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFMNPMVMMMVLPLLIFVLLPKV^ 
QSM3SIMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites . 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-23 6 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTG'TGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
« AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
j TGCAAGCCAAAGATGGAAACAT TGACAT CAGAATC TTAAGGAGGACTGAGTCTTTGCAAGACACA 

1 AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
| AAAC TAG C AGAC C C C T GAC CAT TAT AC T C T C C GGAAG AT C AGC AG C C T C G C CAAT TCCTTTCTTA 

:j CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
1 AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
1 T T TGGGGGAAC TAGACAT TC T T CTGCAAT GGATGGAGGAGACAGA ATAG GAGGAAAG T GATGC T G 

CT GCTAAGAATATT CGAGGTCAAGAGCT CCAG TC TT CAATACC T GCAGAGGAGGCAT GACCCCAA 
' ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 

\ 

j CT TCC T TGCATGAT T GTC T T T AT GCATCCCCAATCT TAAT TGAGACCATAC T TG TATAAGAT TT T 

| T GTAATATCT TTCTGCTATTGGATATAT T TAT TAGT TAATATAT TTATTTATTT TTTGCTAT TTA 

1 AT G T AT T TAT T T T T T TAG T T G GAC AT GAAAC T T T AAAAAAAT T C AC AGAT TAT AT T T AT AAC C T G 

I AC TAGAGCAGGTGATGTAT TT T TAT ACAG TAAAAAAAAAAAACC TT GTAAAT TC TAGAAGAGTGG 

C TAGGGGGGT TATTCAT TTGT AT TCAAC TAAGGACATAT T TAC T CAT GCT GAT GC TC TGT GAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
S SREAWQ FFLLLWS PD FRPKMKAS S LAFS LLS AAFYLLWTP S TGLKTLNLGS CVI ATNLQE IRNG 
FSE IRGSVQAKDGN I D IR I LRRTE S LQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKI S 
SLANS FLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGELDILLQWMEET 
E 

Important features of the protein: 
Leit Signal peptide: 

0 amino acids 1-42 

7 1 cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

Uf 

h j ammo acids 192-195, 225-228 

"H 

EM N-myristoylation sites. 

amino acids 42-47, 4 6-51, 13 6-141 

1 V\ 

CIS 
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FIGURE 13Q 

CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCGGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCT GAAGCAAAGCCAC CT ACAG T TAC TAT GC CT CGAAT CAAGGCAT TGAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAGACAAGCAAAAGCAGC TGGAAAAAGAATAGTC T T TTATGGAGAT GAAACC T GGG T TAAAT TA 
T TCCCAAAGCAT T T T GT GGAATATGAT GGAACAACC TCAT TTTTCGTGT CAG AT TACACAGAGGT 
GGATAATAAT G TCACGAGGCAT T TGGATAAAGTAT TAAAAAGAGGAGAT T GGGACATAT TAAT CC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGC TGAGCGAGAT GGACAGC GT GC TGATGAAGATCCACACCT CAC T GCAGT CGAAGGAGAGAGA 
GACGCC TT TACCCAAT T T GC T GGT TCT T T GT GG TGACCATGGCAT G T CT GAAACAGGAAGT CAC G 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
AC T T GGCT T ACCGAT TC CAAAAGACAGT GT AGGGAGCC TCCTAT TC CCAG T TGTGGAAGGAAGAC 
CAAT GAGAGAGCAGT TGAGATT T T TACAT T TGAATACAG TGCAGC T T AGTAAAC T GT TGCAAGAG 
AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
GAAC TGGAT CAGACT GTAC T TGGAGGAAAAGCAT T CAGAAGT CCTAT T CAACC TGGGC TCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGT GAT TG TAAAGAGCT GGCGGTCACAGAGGAACAAGCC CCCCAGC TGAGGGGGTGT G TGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CT GCACACAG TATGTAGT T AC CAAAAGAATAAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLGS GT FATCCVAI E VLG I AVFLRGFFPAPVRS S ARAEHGAE PPAPE PS AGAS SNWTTL PPPL F 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNMV 
. TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide : 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAAC TGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAAT TCAGCAT TCATGACCAGGAT CACAAAGTAC TGGT.CCTGGACTC TGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGT^GTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GT GACAAGGATAAAGGACAAAG T CAT C CAT CCC T TCAGC T GAAGAAGGAGAAAC T GATGAAGC T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTT GGGGTGACAGATAAAT T T GAGAACAGGAAACACATTGAAT TT TCAT T TCAACCAGTT TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T GAACTAAT TGTATAAAAACACCAAACC T GC TCAC T 
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FIGURE 142 

MLLLLLE YNFP I ENNCQHLKTTHT FRVKNLNPKKFS I HDQDHKVLVLDS GNL I AVPDKNYI RPE I 
FFAIASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site . 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE \a<\ 



CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCGTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAGT T T GC TGTCCACACAT T CAACCAACAGAGCAAGGACTAC TATGCCTACAGACTGGGG 
U CACATCT T GAATTCC TGGAAGGAGCAGG T GGAG TCCAAGAC T G TATT C TCAATGGAGCTAC TGCT 

Q GGGGAGAACTAGGTGTGGGAAATTTGAAGACGACATTGACAACTGCCATTTCCAAGAAAGCACAG 
Jjj AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
'f ■ CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
%j GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
f|j TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
SO ACACAT C T T GAGCC T AATCATGT AGT GT AGATCAT TAAACAT CAGCAT T T TAAGAAAAAAAAAAA 

wl 
if! 

1 
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FIGURE iaa 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFTIEQRDCDEH]Sn7MARYLPATVEFAVHTFN0QSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 14* 

C TGT GCAGC T CGAGGC T C CAGAGGC ACAC T CC AGAGAGAGCC AAGGT T C T GACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GAC GAGGGGC AT C AAGC AC AGAAT CAAG T GGAACC GGAAGGCC C T GCC CAG CAC TGC CCAGAT CA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCGCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCAGGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTGTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C T C TAAT GAAAT T G T GAAAGC T C CAT G T T T AGAAATAAAT GAAAAC AC C T GA 
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FIGURE izl6 

MRKHLSWWWLATVCMLLFSHLSAVQTRGIPCHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE izt7 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



I** N-myristoylation site. 

til amino acids 62-68 

Q 

m 

UJ 
H 

ru 
m 



Q 
if! 

a 
m 

Q 

m 
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FIGURE iaq 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTT GAATGAACAGATTGCT GAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGT T GATAACTT GAACC TGC TAAAGGCAATAACAGAAAAGGAAAAAAT TGAGAAAGAAAGACAAT CT A 
TAAGAAGC T CCCCACTT GATAAT AAGT T GAAT GT GGAAGAT GT T GATT CAAC CAAGAAT C GAAAACT G 
ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 
TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
AAAATGACAGAGCCGTGTTTGACAAGATT GTT TCTAAAC TACTTAATC T CGGC CT TATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 
TTATGAGGAGGAT CC CAATAAGC CCACAAGCT GGAC TGAGAAT CAGGCT GGAAAAAT AC CAGAGAAAG 
TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
TTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 
AT AT T TC CCAAAT TT CT AT GCGCTACTGAAAAGTAT TGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACT GAT TACTAT CATGAAAACAC TGATT GACTTT GT GAAGATGATGG T GAAATATGGAACAATATC T 
CCAGAAGAAGGT GTTT CCTACCTTGAAAACTT GGAT GAAAT GATT GCT CTT CAGACCAAAAACAAGCT 
AGAAAAAAATGC TACT GACAATATAAGCAAGCT TTT CCCAGCAC CAT CAGAGAAGAGTCAT GAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAAT GGT TGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTAT GACC TT T 
CAAAGATGAGAGACT T CAT CAATAAACAAGCT GAT GCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCT GAAAGTAAAGT T GTAT GTAAGCT GAAAAAAAAAAAAAAAAAAAA 
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FIGURE 1*0 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIASAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKIEKERQS IRSS PLDNKLNVEDVDSTKNRKL I DDYDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTL I DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDMISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 

NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCT TC TAGCTGGAGG GC T GCATGCAGGGAAGGTCAT TAAAGGT GAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
T GT CAGAC TCAC CCAGC T TCC CGAGAAT GGTGGC T GGAATGC C CCCAT CACAGAC TTCTAC T TCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAATGGC T CGAGC TCAGAAGAT AAAAGATAAG TAGGGTATGC TGAT CCT CT TT TAAAAAC CCAA 
GATACAAT CAAAAT CCCAGAT GCT GGT CT CTAT TCCCATGAAAAAGT GCT CAT GACATAT T GAGA 
AGACCTACTTACAAAGTGGCATATATT GCAATT TAT TT TAATTAAAAGATACCTAT T TATATAT T 
TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT T CC T T GT ATAAAAATAAGAAAAGAAAT TAATC T TGAGGTAAGCAGAGCAGACATCATC T CT GA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T TGTAG T AGTGAT CAGGAAACAGATC TCAGCAAAGC CAC T GAGGAGGAGGC TGTGC T GAGT TTG T 
GT GGC TGGAATC TC TGGGTAAGGAAC TTAAAGAACAAAAATCAT CT GGTAATT C T TT CC TAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GT C T CT GCAGATG TAGT TAG T TAAGACAAGGT CAT GC TGGATGAAGGTAGACC TAAAT TCAATAT 
G AC TGGTTTCCTTG TAT GAAAAG GAG AG G AC AC AGAGAC AGAG GAG AC G C G G G G AAG AC TAT G T A 
AAGAT GAAGGCAGAGAT CGGAGT TT TGCAGCCACAAGCTAAGAAACACCAAGGAT T GT GGCAAC C 
AT CAGAAG C T T GGAAGAG GCAAAGAAGAAT T C T T C C C T AGAGGC T T T AGAG G GAT AAC GGC T C TG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAAT ATGGCAGAAGTGAT GGCAT GCCACT T CCAAGAT TAGGT TATAAAAGACAC T GCAGCTT C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGT GAACCTAGAAGCAGAGAC TC T G TGAGAT AAT CGATG TT T GTTGTTTT 
AAGTTGCTCAGTTTTGGTCTAACTTGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFT^KDS ALKVL YLHNNQLLAGGLHAGKVIKGEE I S WPNRWLDASLS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 



Interleukin-1 signature. 

amino acids 111-131 



Interleukin-1 proteins. 

I , amino acids 8-29, 83-120, 95-134, 64-103 



m 



£1 
111 

Hz* 
hrJ 



ill 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAA.TCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAA&GCT GAAGGACACAGTGAAAAAGC 
T T GGAGAGAGTGGAGAGATCAAAGCAAT TGGAGAAC TGGAT T T GC TGT T T AT GT CT C TGAGAAAT 
GCCTGCATT TGAC CAGAGCAAAGC T GAAAAATGAATAAC TAACCCCCTTTCCCTGC TAGAAATAA 
CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCT^CTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGAC CAGAAGGTAGAC T TTC TAAGC AT AGAT AT T TAT TGATAACAT T TCAT TGTAAC TGGTGT T C 
TATACACAGAAAACAAT TTAT TT TT TAAATAAT TG TC T T TT T CCATAAAAAAGAT TACT TT CCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
TGTATTTATTATTATTATAAGACTGCATT TTAT TTATATCAT TTTATTAATATGGATTTAT TTAT 
AGAAAC AT CAT T C GAT AT TG C T AC T T GAG T G T AAGGC T AATAT T GATAT T TAT GACAAT AAT TAT 
AGAGCTATAACATG TT TAT T TGACC T CAAT AAACAC TT GGATAT CCC 
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FIGURE 154 

MAALQKS VS S FLMGTLATSCLLLLALLVQGGAAAP IS SHCRLDKSNFQQPYI TNRTFMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 



APP_ID=1 0063728 



Page 308 of 322 



FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAG G GAGCGAC CCAGAT TAGGT GAGGACAG T TC TCT CAT TAGCC TT T T CC TACAGGT GGT T GCAT 
TGTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
; GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
V ? | CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
Q TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
£fl GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
Ul AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 

ill 

fx ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
a CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAAGCTGCACTTCTGCACATTTTGAAAAGAG 
Q CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 

GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
% GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
P% TGTTTGTT TAG T CAT CAC T CAG T GAGCAT C T AC T T TGGGTGCAT TC TAG T G T AG TT AC TAG T C T T 

f|| TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 

CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFIJyyrVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVP 

RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 

ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites - 

amino acids 44-50, 150-156 
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FIGURE iR7 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
C C G GAGAC T T GAGGGAC C T C C GAG T AGAAC C T G T TACAAC TAG T G T T GC AAC AGGGGAC TAT T CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGC TGAACACAGTC TAT T TCAT TGGGGCCCATAATAT T CCTAATGCAAATAT GAATGAAGAT GG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AG T GTGTCAAGGCC GGAAGCC TGTGGGAT CCGAACATCACTGC T T GTAAGAAGAATGAGGAGACA 
GT AGAAGT GAAC T T CACAACC AC T CCC CTGGGAAACAGATAC AT G GC T C T TAT CC AACAC AGCAC 
£P TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
l fi t TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
P I{ AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
m GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
a CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
Q T CT ACCACCACACTAC TGCCCCCCAT TAAGGT TC T T GTGGT T TAC C CAT C T GAAATAT GT T TC CA 

J£j TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
% AGT GGCAGAAAAAGAAAATAGCAGAGAT GGGT CCAG TGCAGTGGC T T GCCACT CAAAAGAAGGCA 

fi| GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
f|J GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATCTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACA 
AAAGAC GAT TACAATGC TCTCAGTGTCT GCC CCAAGTAC CAC C TCAT GAAGGATGCCAC TGCT T T 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWI4LQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSY-SCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCV 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEV ILE KWQ KKKIAEMGPVQWIATQKKAADKWFLLSNDVN S VCDGTCGKS EGSP S ENS QDL FPL A 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 



W Signal peptide: 

rfl 

amino acids 1-14 



S\ 

ml Transmembrane domain: 

; -it 

fi\ amino acids 290-309 

Pi N-glycosylation sites. 

Lf| amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 



287 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 



Amidation site. 

amino acids 488-452 
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FIGURE l*Q 

AGCCAC C AG C GCAAC ATGA C AG T G AAGACC CT GCATGGCCCAGC CAT GG T CAAG TAC T TGC TGC T 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
AC T CAG C T G AAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLS ILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGI INEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 




N-myristoylation sites. 

amino acids 10 6-111, 136-141 



a 
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FIGURE 161 



ACACTGGCCAAACAAMACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAG ATG CCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGC CTGAAGATGAGGAAAAGT T TGGAGGAGCAGC TGAC T CAGGGG T GGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
. . TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
H TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
£!f GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
Q ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
f% GTCCAGGGCCCCCCAAAACCCCGGTGGCACT^AAAACCTGACTGGACCGCAGATCATTACCTTGAA 
?\\ CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
y f CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
%l CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
f|| GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
fl\ TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
W AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
CI GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
i|| CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
Q CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
f|| CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
. CGGGAC T TAAA TAAAGGCAGAC GC TGTTTTTC TAAAAAAA 
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FIGURE 162 

MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVIAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDK7LEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGWVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
P LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

3] Signal sequence: 

ill amino acids 1-20 

fit 

* w Transmembrane domain . 

m 

ammo acids 453-475 
ffj . N-glycosylation sites. 

r| amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 

f|f 334-337, 357-360, 391-394 

Q 

f|| Glycosaminoglycan attachment site. 

amino acids 583-586 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TAC CACT TAGAGCT CCAGGT CAACCGCACCTAC CAAATGCACC T TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCT GGGCCAAGGAGAGT GCCCCCTACAT GTGCCGAG TGAAGACAC T GCCAGACCGGACAT GGACC 
U TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
f?* CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
b:! TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
W AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
Q\ TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
hi CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
t m \ GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
J* GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
!*W CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
Hi CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
CCCTGGGGAT T TGCACAGACAGAACATCT GACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 
%l CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
C| TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
f{| GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
ATCCCATGCCTGGCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
1H AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAG T GCAAGGAGGAAAT GCAGGGAAAC T CCCGAGGT CCAGAGC CCCACC T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAG GTCCACCAGGGAGGGAGACACACAAGCC T T T T C T GCAGGCAG GAGT T TCAGACC CT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE i6a 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
E FLG T IM I CVP TWAKE S AP YMCRVKT LP DRT W T YS FSGAFL FSMGFLVAVLCYL S YR YVTKP PAP 
PNSLNVQRVLT FQPLRF I QEHVL I PVFDLSGPS S LAQPVQYS QI RVS GPRE PAGAPQRHSLSE I T 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 

TGGCCTACTGGAAAAAAAAAiWyy^AAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC^TGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGTTGAACACAGTTTTGGATAT T TTCCAAAAGAT T TGATCAAGGTACT TCATAAATACACGGA 
AGAAGAGCT ACATAT TC CAGCAGAT GAGACAGAC T T TG T C TGCT TTGAAGGAGGAAGAGATGATT 
TTAATAGTTATAATGTAGAAGAGCT TT TAGGATCT TTGGAACTGGAGGACTCTGTACCTGAAGAG 
TCGAAGAAAGC T GAAGAAG T T T C T CAG CACAGAGAGAAATC TC C T GAGGAGTC T CGGGGGCGT GA 

ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCAC CGAGGGGC TGCAGGGACAGC CC TC AGC TCAGGAGAGC CAC CC TCAC ACCAGC GGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
CAT T AC AGC AAAGGAT T TC G T T GG CAT CAAAAT C T AAG T T T GT T T T AC AAAGAT T G T T T T TAG TA 

CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 

"I 
1 
I 
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FIGURE 166 



'MAAAPGLLFWLFVLGALWWP'GQSDLSHGRRFSDLKVCGDEEGSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKT DAYKVLKTEMSQRGSGQCVI HYSKGFRWHQNLSLFYKDC F 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTtCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
i ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
I TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
J ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
! CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
J CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
I GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
J ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
1 GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGT ATAA TAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
) GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
! AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
I TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
1 ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
I TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
I TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACT TAT TACTGTTGTCAACTGAAT TCACACACACACAAATATAGTACGATAGAAAAAGT TTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAA?LATTAAACAGATT 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAA.GAAG 
T G AC A T AC AC AAT AT AAAT CAT AT G T C T T C AC AC G T TGCCTATATAATGAGAAGCAGCT CT CTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAZU\AAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVS FQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTA 
PYLTNQPGDTHQNWVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
S TLW I S KS TRKE S GMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

w 

;P Transmembrane domain: 

m 

T\ amino acids 214-235 

i 

rli 

m N-glycosylation sites. 

« amino acids 86-89, 255-258 

*J cAMP- and cGMP-dependent protein kinase phosphorylation site. 

?n amino acids 266-269 

JiJ 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
' 145, 212-217 
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